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CN ■ Abstract 

i The inspection of residuals is a fundamental step to investigate the 

r l) ■ quality of adjustment of a parametric model to data. For spatial 

point processes, the concept of residuals has been recently proposed 
£f) ■ by Baddeley et al. Baddeley et al. (2005) as an empirical counterpart 

of the Campbell equilibrium equation for marked Gibbs point processes. 
The present paper focuses on stationary marked Gibbs point processes 
and deals with asymptotic properties of residuals for such processes. In 
particular, the consistency and the asymptotic normality are obtained 
for a wide class of residuals including the classical ones (raw residu- 
als, inverse residuals, Pearson residuals). Based on these asymptotic 
results, we define goodness-of-fit tests with Type-I error theoretically 
controlled. One of these tests constitutes an extension of the quadrat 
counting test widely used to test the null hypothesis of a homogeneous 
■ Poisson point process. 
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1 Introduction 

Recent works on statistical methods for spatial point pattern makes parametric inference feasible 
for a wide range of models, see M0ller (2008) for an overview of this topic and more generally the 
' books of Daley and Vere- Jones (1988), Stoyan et al. (1987) M0ller and Waagcpetersen (2003) or 

Illian et al. (2008) for a survey on spatial point processes. The question is then to know whether 
the model is well-fitted to data or not. For classical parametric models, this is usually done 
via the inspection of residuals. They play a central role in parametric inference, see Atkinson 
(1985) for instance. This notion is quite complex for spatial point processes and has been re- 
cently proposed by Baddeley et al. Baddeley ct al. (2005), following ideas from a previous work of 
Stoyan and Grabarnik Stoyan and Grabarnik (1991). 

The definition of residuals for spatial point processes is a natural generalization of the well- 
known residuals for point processes in one-dimensional time, used in survival analysis (see Fleming and Harrington 
(1991) or Andersen ct al. (1993) for an overview). For example, a simple measure of the adequacy 
of a one-dimensional point process model consists in computing the difference between the number 
of events in an interval [0,t] and the conditional intensity (or hazard rate of the lifetime distri- 
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bution) paramctrically estimated and integrated from to t. The extension in higher dimension 
requires further developments due to the lack of natural ordering. It may be done for point pro- 
cesses admitting a conditional density with respect to the Poisson process. These point processes 
correspond to the Gibbs measures. The equilibrium in one dimension between the number of events 
and the integrated hazard rate may be replaced in higher dimension by the Campbell equilibrium 
equation or Georgii-Nguyen-Zessin formula (see Gcorgii (1976), Nguyen and Zessin (1979a) and 
Section 2.3), which is the basis for defining the class of h— residuals where h represents a test func- 
tion. In particular, Baddcley et al. Baddeley et al. (2005) consider different choices of h leading 
to the so-called raw residuals, inverse residuals and Pearson residuals, and show that they share 
similarities with the residuals obtained for generalized linear models. 

Thanks to various diagnostic plots developped in the seminal paper Baddcley et al. (2005) and 
implementation within the R package spatstat Baddcley and Turner (2005), residuals appear to 
be a very convenient tool in practice. Some properties of the residuals process are exhibited in 
Baddcley ct al. (2005) and Baddcley et al. (2008), including a conditional independence property 
and variance formulae in particular cases. In these two papers, the authors conjecture that a strong 
law of large numbers and a central limit theorem should hold for the residuals as the sampling 
window expands. 

Our paper addresses this question for d— dimensional stationary marked Gibbs point pro- 
cesses. We obtain the strong consistency and the asymptotic normality in several contexts for 
a large class of test functions h. The h— residuals crucially depend on an estimate of the pa- 
rameter vector. We consider the natural framework where the estimate is computed with the 
same data over which the h— residuals are assessed. The assumptions are very general and we 
show that they are fulfilled for several classical models, including the area interaction point pro- 
cess, the multi-Strauss marked point process, the Strauss type disc process, the Geyer's triplet 
point process, etc. The assumptions on the estimator are quite natural and we show that they 
are fulfilled in particular by the maximum pseudolikclihood estimator (in short MPLE) (see 
Baddcley and Turner (2000) for instance), for which asymptotic properties are now well-known (see 
Jensen and M0ller (1991), Jensen and Kunsch (1994), Billiot et al. (2008), Dereudre and Lavancier 
(2009) and Coeurjolly and Drouilhet (2009)). 

Moreover, based on these asymptotic results, we propose statistical goodness-of-fit tests for 
which the Type-I error is asymptotically controlled. To the best of our knowledge, this is the 
first attempt in this direction. Such tests exist for rejecting the assumption of a homogeneous 
or inhomogeneous Poisson point process, but for general marked Gibbs point processes, the ex- 
isting validation methods are either graphical (for example by using the QQ-plot proposed by 
Baddcley et al. (2005)) or rely on Monte-Carlo based simulations. We present two tests based on 
the computation of the residuals on different subdomains of the observation window. They extend 
in a very natural way the quadrat counting test for homogeneous Poisson distributions (see Digglc 
(2003) for instance). Besides, we present a test which combines several different h— residuals (asso- 
ciated to different functions h) , computed on the entire observation window. The next step will be 
to implement these testing procedures to assess their power, compare them and reveal their limits. 
A thorough study will require extensive simulations and should deserve a separate paper. 

The rest of the paper is organized as follows. Section 2 gathers the main notation used in 
this paper and briefly displays the general background. The definition of marked Gibbs point 
processes is given. They depend exclusively on the choice of an energy function or equivalcntly, 
of a local energy function. All the assumptions are based on this function. The Georgii-Nguyen- 
Zessin formula is recalled, leading to the definition of the h— innovations and h— residuals. Some 
examples are presented including the classical residuals considered by Baddcley ct al. (2005) and 
new ones connected to the well-known empty space function (or spherical contact distribution), 
denoted in the literature by F, see M0ller and Waagepetersen (2003) for instance. 

Section 3 deals with asymptotic properties and presents our main results. A parametric station- 
ary d-dimensional marked Gibbs point process is observed in a domain, denoted by A„, assumed to 
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increase up to R . Sufficient conditions expressed in terms of the test function and the local energy 
function are given in order to derive the strong consistency result. We also propose an asymptotic 
control in probability of the departure of the /i-rcsiduals process from the /i-innovations through 
the departure of the estimate from the true parameter vector (see Proposition 3). This allows us 
to deduce asymptotic normality results. Two different frameworks are considered: for the first 
one, the initial domain is splitted into a fixed finite number of subdomains (with volumes aimed 
at converging to +00) and we consider the vector composed of the h— residuals computed on each 
subdomain. For the second framework, we consider the vector composed of the hj— residuals (for 
j = 1, . . . , s) computed on the same domain A„, where hi, . . . , h s are different test functions. 

The asymptotic normality results depend on unknown asymptotic covariance matrices. The 
important question of estimating these matrices is addressed in Section 4. We give a general 
condition under which these matrices are definite-positive and propose a consistent estimate. 

Section 5 exploits the asymptotic results obtained before. Some goodness-of-fit tests are pro- 
posed, based on normalized residuals computed in the two previous frameworks. They are shown 
to converge to some \ 2 distribution. Framework 1 leads to a generalization of the quadrat counting 
test for homogeneous Poisson distributions. Framework 2 yields a test which combines the infor- 
mation coming from several residuals, as for instance residuals coming from the estimation of the 
empty space function at several points. 

The different assumptions made in the previous sections to obtain asymptotic results are dis- 
cussed in Section 6. When considering classical test functions, exponential family models and the 
MPLE, the regularity and intcgrability type assumptions are shown to be satisfied for a wide class 
of examples. The testing procedures require moreover an identifiability condition to provide a 
proper normalization. Proposition 16 shows that this condition is easy to check for the first pro- 
posed test and appears to be not restrictive in this case. For the other tests, checking this condition 
depends more specifically on the model and the test function. We show, in Proposition 18, how 
this condition can be verified on two examples of models with several choices of test functions. 

In Section 7, the very special situation where the energy function is not hereditary is considered. 
The GNZ formula is not valid any more in this setting but, provided a slight modification, it has 
been recently extended in Dereudre and Lavancier (2009). This leads to a natural generalization 
of the residuals to the non-hereditary setting. 

Proofs of our main results are postponed to Section 8. The main material is composed of 
an ergodic theorem obtained by Nguyen and Zessin (1979b) and a new multivariate central limit 
theorem for spatial processes. Our setting actually involves some non stationary conditional cen- 
tered random fields. A general central limit theorem adapted to this context has been obtained 
in Comets and Janzura (1998) for self-normalized sums (see also Jensen and Kiinsch (1994) in the 
stationary case and without self- normalization) . But, contrary to these papers where the obser- 
vation domain is assumed to be of the form [—n,n] d , we consider domains that may increase 
continuously up to M. d . This particularity, which seems more relevant, requires an extension of the 
results in Comets and Janzura (1998) and Jensen and Kiinsch (1994) to triangular arrays. This 
new central limit theorem is presented in Appendix A. 

2 Background on marked Gibbs point processes and defini- 
tion of residuals 

2.1 General notation, configuration space 

We denote by B(R d ) the space of bounded Borel sets in R d . For any A 6 B(M. d ), A c denotes the 
complementary set of A inside R d . The norm |.| will be used without ambiguity for different kind of 
objects. For a vector x, |x| represents the uniform norm of x; For a countable set J ' , represents 
the number of elements belonging to J; For a set A G B{W l ), |A| is the volume of A. 



3 



Let M be a matrix, we denote by ||M|| the Frobenius norm of M defined by ||MI| 2 = Tr(M T M), 
where Tr is the trace operator. For a vector x, ||x|| is simply its euclidean norm. 

For all x G R d and p > 0, 2?(x, p) :~ {y, |y — x| < p). Let us also consider the short notation, 
for i G Z d , B^p) = B(i, p) n Z d . 

The space K d is endowed with the Borel er-algebra and the Lebesgue measure A. Let M be a 
measurable space, which aims at being the mark space, endowed with the cr-algebra M. and the 
probability measure A m . The state space of the point processes will be $ := t d x M measured by 
p, := A ® A™. We shall denote for short x m = (x, m) an element of S. 

The space of point configurations will be denoted by f2 = This is the set of simple 

integer- valued measures on $. It is endowed with the a-algebra T generated by the sets {(p G 
fl, ip(A x A) = n} for all n G N, for all A G M and for all A G B{R d ). For any x m G S and ip G Q, 
we denote x m G <p if <p(x m ) > 0. For any <p G O and any A G £>(IR d ), we denote <pa ■— ^Axm 
the projection of tp onto A x M, which is just the mesure 2x m evn(AxM) ^ m > wnere S x is the 
Dirac measure at x. We will use without ambiguity some set notation for elements in f2, e.g. 
ipU{x m } = ipUx m := ip + 5 x , n and for x m G <p, ip\{x m } = ip\x m := (p-S xm . For any A G B(R d ), 
the number of elements of p\ is denoted by \cp\\ := tp(A x M). 



2.2 Marked Gibbs point processes 

The framework of this paper is restricted to stationary marked Gibbs point processes. Since we 
are interested in asymptotic properties, we consider these point processes on the infinite volume 
R d . Let us briefly recall their definition. 

A marked point process $ is a fi-valued random variable, with probability distribution P on 
(£1, J 7 ). The most prominent marked point process is the marked Poisson process ~k v with intensity 
measure v on M. d (and mark density A m ). The homogeneous marked Poisson process arises when 
v = zA, with z > 0. 

Let 9 G ©, where is some compact set of K p (for some p > 1). For any A G B(R d ), let us 
consider the parametric function V\(.;8) from into K U {+oo}. For fixed 9, (Va.(.; #))AeB(K <i ) 
constitutes a compatible family of energies if, for every A C A' in B(M. ), there exists a measurable 
function ^a,A' from into R U {+00} such that 

V<^Gfl Vk-(wO) = Vk{w9) + iI)k#(<pso-,8). (1) 

From a physical point of view, Va(<^a; 9) is the energy of pa in A given the outside configuration 
(p\c . The following definition is the classical way to define Gibbs measures through their conditional 
specifications (see Preston (1976)). 

Definition 1. A probability measure Pg on £1 is a marked Gibbs measure for the compatible family 
of energies (Va(.; #))AeB(R d ) an d the intensity v if for every A G B(M. d ), for Pg- almost every outside 
configuration tp\c , the law of Pg given <pa c admits the following conditional density with respect to 
ir u : 

h{<PA\<PA-;0) = 1 e- y ^ e \ 
Za{pa-;9) 

where Za(<Pa c ]9) is a normalization called the partition function. 

The existence of a Gibbs measure on ft which satisfies these conditional specifications is a diffi- 
cult issue. We do not want to open this discussion here and we will assume that the Gibbs measures 
we consider exist. We refer the interested reader to Ruelle (1969); Preston (1976); Bertin et al. 
(1999); Dereudre (2005); Dereudre et al. (2010), see also Section 6 for several examples. 

In this article, we focus on stationary marked point processes on $, i.e. on point processes 
admitting a conditional density with respect to the homogeneous marked Poisson process n. More- 
over, without loss of generality, the intensity of the Poisson process, z, is fixed to 1. We assume in 
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a first step that the family of energies is hereditary, which means that for any A G B(M ), for any 
tp G fl, and for all x m G A x M, 

V A (tp; 9)) = +cx) => V A (ip U {x m }; 9)) = +oc, (2) 

or equivalently, for all x m G tp A , /a(¥>a|¥>a c ; #) > => /a&A n {a; m }|v?A c ; > 0. The non- 
hereditary case will be considered in Section 7. The main assumption is then the following. 

[Mod-E]: For any 9 G ©, the compatible family of energies (Va(.; 9))AeB(R d ) 1S hereditary, 
invariant by translation, and such that an associated Gibbs measure Pg exists and is sta- 
tionary. Our data consist in the realization of a point process with Gibbs measure Pg* . The 
vector 9* is thus the true parameter to be estimated, assumed to be in 0. 

The local energy to insert a marked point x m into the configuration tp is defined for any A 
containing x m by 

V(x m \tp;6) :=V A (tpU{x m })-V A (tp). 

From the compatibility of the family of energies, i.e. (1), this definition does not depend on A. We 
restrict our study to finite-range interaction point processes, which is the main limitation of this 
paper. 

[Mod-L] : There exists D > such that for all (m, tp) G M x fi 

V(0 m \tp;6)=V(0 m \tp mD y,9). 

2.3 Definitions of residuals for spatial point processes 

The basic ingredient for the definition of residuals is the so-called GNZ formula stated below. 

Theorem 1 (Georgii-Nguyen-Zessin Formula). Under [Mod-E], for any function h(-, •;#):$ X 
fi — > R (eventually depending on some parameter 9) such that the following quantities are finite, 
then 

E ( [ h (x m , $; 9) e- v ^ m ^' 8 *^(dx m )] = E ( V h (x m , $ \ x m ; 9) ) , (3) 

where E denotes the expectation with respect to Pg* . 

For stationary marked Gibbs point processes, (3) reduces to 

E (h (0 M , <f>; 9) e -v(o M \*-r)\ = E ( h ( Q A/ ; $ ^ q m. (4) 



where M denotes a random variable with probability distribution A m . The following definition is 
based on empirical versions of both terms appearing in (4). 

Definition 2. For any bounded domain A, let us define the h— innovations (denoted by I A ) and 
the h— residuals (denoted by R A and depending on an estimate 9 of 9* ) by 

I A {tp;h,9*) := f h(x m ,tp;6*)e- v ^ m ^' e ^n{dx m )- h (x m , tp \ x m ; 6*) 

R A (tp; h, 9) := 



f h(x m ,tp ] e)e- v ( xm ^ ;S ) i i{dx m )- V h(x m ,ip\x 
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From a practical point of view, the last notion is the most interesting since it provides a 
computable measure. The main examples considered by Baddcley et al. in Baddeley ct al. (2005) 
(in the context of stationary point processes) are obtained by setting ft (x m , ip; 9) = 1 for the raw 
residuals, h(x m ,<p;9) = e v r ( x 'l^- 6 ' for the inverse residuals and ft (x m , (/?; £>) = e^™^- 6 '/ 2 for the 
Pearson residuals. In particular, one may note that the raw residuals constitutes a difference of 
two estimates of the intensity of the point process (up to a normalisation by |A|): the first one is 
a parametric one and depends on the model while the second one is a nonparametric one (since it 
is equal to |va|)- Another more evolved example is to consider the function defined for r > by 

h r {x m ,^e) := l [0 , r] (d(x m ,<p)) e v ^ 1 ^ 

where d{x m , ip) = vai v m^ v \\y — x\\. Considering this function leads to 

R A (<p;h r ,6)= f l [a . r] (d(x m ,^))fi(dx m )- V h r (x m ,p\x m ;9). (5) 

Then for a large window R((p;hh,0)/\A\ leads to a difference of two estimates of the well-known 
empty space function F at distance r. Recall that for a marked stationary point process (see 
M0ller and Waagcpetersen (2003) for instance) 

F(r) := P(d(0 M ,$) < r) . 

The first term in the right hand side of (5) corresponds to the natural nonparametric estimator of 
F(r) while the second one is a parametric estimator of F(r). 



3 Asymptotic properties 

From now on, we assume that the point process satisfies [Mod-E] and [Mod-L], that is [Mod]. 
The realization of $ ~ Pg* is assumed to be observed in a domain A„ © D + , with D + > D, aimed 
at growing up to R d as n — > +oo. According to the locality assumption [Mod-L], we are thus 
ensured that the ft,— innovations and ft— residuals can be computed. 

The aim of this section is to present several asymptotic properties for I\ n and R\ n . We prove 
their consistency and we propose two asymptotic normality results within different frameworks: 

• Framework 1: for a fixed test function ft, A„ is a cube, divided into a fixed finite number 
of sub-cubes (which will increase with A„). The purpose is then to obtain the asymptotic 
normality for the vector composed of the ft— residuals computed in each sub-cube. 

• Framework 2: we consider hi, . . . , h s s different test functions and the aim is to obtain the 
asymptotic normality of the vector composed of the hj— residuals computed on A n . 

In both frameworks, an estimate of 9* is involved. We assume that it is computed from the 
full domain A„ with the same data used to evaluate the ft— residuals, which is a natural setting 
in practice. Moreover, contrary to the previous works dealing with asymptotic properties on 
Gibbs point processes (e.g. Jensen and Ktinsch (1994), Comets and Janzura (1998) or Billiot et al. 
(2008)), where A„ is assumed to be of the discrete form [—n, n] d , we consider general domains that 
may grow continuously up to M. d . 

The asymptotic results obtained in this section are the basis to derive goodness-of-fit tests, as 
presented in Section 5. 

3.1 Consistency of the residuals process 

We obtain consistency results for 1^ ($; ft, 9*) and Rjy ft, 9 n ($)j , where for all n > 1, A„ C 

An, (A n ) n >i and (A„)„>i are regular sequences whose size increases to oo. 
The assumption [C] gathers the two following assumptions: 
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[CI] 



E 



(|/i(0 M ,$ ; r)|e- y (° M l*^)) 



< +00. 



[C2] For all (m, ip) € M x fl, the functions h (0 m , <p; 9) and / (O m ,ip; 9) := h (0 m , ip; 9) e -V(o m \<p-,e) 
are continuously diffcrentiable with respect to 9 in a neighborhood V(9*) of 0* and 



E 



f (i) f M ,$ : r 



< +00 and E 



h (i) ( A/ ,$ ; r 



-l/(o M |4>;0* 



< +OO, 



where denotes the gradient vector of / with respect to 9. 
Concerning the residuals process, we also need to assume 

[El] The estimator 9 n (p) of 6* , computed from the full observation domain A„, converges for 
Pg»— a.e. ip towards 9*, as n — > +00. 

Proposition 2. Assuming [Mod], we have as n — > +00 

(a) Under [CI]: for Pe* — a. e. ip, |A„| _1 /^ {^'^,9*) converges towards 0. 

(b) Under [C] and [El]: for Pg*~a.e. tp, \A n \~ 1 R^ (^p; h, 9 n (tp)j converges towards 0. 

Remark 1. Assumption [Mod-L], while useful to allow the computation of the residuals in prac- 
tice, is actually useless to prove their consistency. 

3.2 Asymptotic control in probability of the residuals process 

We provide in this section a control for the departure of the residuals from the innovations and 
(9 n — 9*). This is a crucial result to investigate the asymptotic normality of the residuals. We need 
the folllowing assumptions. 

[Nl] For all (to, <p) € M x ft, the functions h (0"\ <p; 9) and / (0"\ ip; 9) (defined in [CI]) are twice 
continuously differentiable with respect to 9 in a neighborhood V(0*) of 9* and 



E 



f (2) (p M ^-e*) 



where g (2) (0 m ,p;9*) 



< +00 and E 



(||h( 2 ) (0 M ,<M*)|e- y ( oM| * ; <>) <+oo, 
for g = f,h. 



i<j,k<p 



[E2] There exists a random vector T such that the following convergence holds as n — > +00 

\K\ 1/2 (*„(*) -0*) At. 

Proposition 3. Under assumptions [C], [Nl] and [El-2], assuming that \A n \ = 0(\A n \), then 



R Kn ($; h, ?„($)) = I Kn ($; h, 9*) - |A n | - 9^ £ (h; 9*) + opQA^ 2 ), (6) 

where £ (h] 9*) is the vector defined by 

£ (h; 9*) := E (h (0 M , $; 0*) V« (0 M |$; 0*) e -^(° M l*;**)) . (7) 

The notation X n ($) = op(w n ) means that w^ 1 X n (&) converges in probability towards as n 
tends to infinity. 

Remark 2. Note that for exponential family models, V^ 1 ) (x m \p; 9*) corresponds to the vector of 
sufficient statistics (see Section 6 for more details). 
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3.3 Assumptions required for the asymptotic normality results 

Apart from the assumptions [Mod] , [C] and [Nl] on the model, we will need to assume [N2-4] 
below. All these assumptions are fulfilled by many models as proved in Section 6. 

[N2] For any bounded domain A, for any 9 £ V(6*), 

E(J/ A ($;M*)| 3 ) <+^- 
[N3] For any sequence of bounded domains T n such that r„ — > when n — s- oo, for any 9 £ V(9*), 

E (l T „ (<I>;M) 2 ) — > 0. 

[N4] For any </?£$! and any bounded domain A, I\ (ip; 9) depends only on </?a©d. 

Concerning the properties required for the estimator 9 n , we need its consistency through [El] 
and to refine [E2] into [E2(bis)] below. Note that the maximum pseudolikelihood estimator 
satisfies these assumptions for many models (see section 6.2). 

[E2(bis)] The estimate admits the following expansion 

- 0* = -^U A „ ($; 8*) + 0p (|A n r 1 /2) 7 

where, for any 9 £ V(8*), 

(i) for any ip £ Q and for two disjoint bounded domains Ai, A2, 

U AiU a 2 (<P, 0) = U Al (<p; 9) + Ua 2 fa 9) , 

(ii) for all j = 1, . . . ,p and any bounded domain A 

E^Ua^^.Q <+ w , 

(hi) for all j = 1, . . . ,p and for any bounded domain A 

e((U a ($;% 

(iv) for all j = 1, . . . ,p and for any sequence of bounded domains r n , 

E ((U r „ 9)f^j — > as T n 0, 

(v) for any tp £ 51 and any bounded domain A, Ua {f, 9) depends only on </?a©d- 

Remark 3. Assumption [E2(bis)J implies [E2J. Indeed, under this assumption one may ap- 
ply Theorem 2.1 of Jensen and Kiinsch (1994) an d assert: there exists a matrix S such that 

|A„|- 1 / 2 U A „ (*;0*)4.Af(O > E), as +00. 



$a= =0, 
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3.4 Asymptotic normality of the h— residuals computed on subdomains 

of A n 

In this framework, we give ourself a test function h and we compute the h— residuals on disjoint 
subdomains of A„. In this context, we assume that the domain A n is a cube and is divided into a 
fixed number of subdomains as follows 

A„ := |J A j>n 

where J is a finite set and all the Aj jTl are disjoint cubes with the same volume |Ao jra | increasing up 
to +00. Let us denote by Rj. n ( <p; h, 9 n ) the vector of the residuals computed on each subdomain, 



i.e. B.j >n (^p;h,8 n J = {Ra^ (p;h,8 n ^ 



According to Proposition 3 and in view of [E2(bis)], we introduce the following notation 

RooAv, h, 9) := I K { V ; h, 9) - U A { V ; B) T £ (h; 9) (8) 
for any tp € f2, for any bounded domain A and for any 9 £ &. 
Proposition 4. Assume that 

• The parametric model satisfies [Mod]. 

• The energy function and the test function h satisfy [CJ and [Nl-4]. 

• The energy function and the estimate 9 n satisfy [El] and [E2(bis)J. 
Then, the following convergence in distribution holds, as n — > +00 

\A 0>n \- 1/2 Rj, n ($;hA) N (0,^(9*)) , (9) 

where = A/ mi Lji + |i/| _1 (Ai{ es — A/ nfl ) J with J = ee 7 and e = (1, . . . , 1) T . The constants 

A/nn ond \R es are respectively defined by 

Xinn = D- d Y, E Ca (d) h, 9*) I Ak[D) ($; h, 6*)) , (10) 

\k\<l 

\Res = D~ d Y ^{Roo,A (D) (®;h, ^RvoAhW (&,h, 0*)), (11) 
lfcl<l 

where, for all k £ Z d , Ak(D) is the cube centered at kD with side-length D. 

From this asymptotic normality result, we can deduce the convergence for the norm of the 
centered residuals. This is the basis for a generalization of the quadrat counting test discussed in 
Section 5. We denote by H.j tn (<p; h) the mean residuals over all subdomains, that is Rj- jra (y; h) = 

Ul^E^jR^n (v;M»). 

Corollary 5. Under the assumptions of Proposition 4, 

|Ao,n| -1 ||R,7,,i($; h) — R i 7 i „($; h)\\ 2 -A \ Inn X f Jhv (12) 

Proof. An easy computation shows that Aj„„ and Xr C s are the two eigenvalues of 5^(0*) 
with respective order \J~\ — 1 and 1. Let P/ ml be the matrix of orthonormalized eigenvectors 
associated to \j nn . This matrix of size (\J\, \ J\ — 1) satisfies by definition Pj- nn Pj nn = 

and, from (9), IAq^I" 1 \\^ Inn Rj, n (ip;h)f A A inn X\j\-y Moreover, it is easy to check that 

P/„nELn = - \J\- X l\j\ which leads to \\& nn R.j, n {<p;h)\\* = \\Rj, n (ip; h) - Rj, n (<p] h)\\ 2 . 



9 



Remark 4. The asymptotic covariance matrix 11^6*) and Xi nn involve only the covariance struc- 
ture of the innovations (or the residuals) in a finite box around 0. This comes from the locality 
assumption [Mod-L], also involved in [N4] and [E2(bis)[. A challenging task in practice is to 
estimate \i nn and XR es (and so S 1 (^*)J, this issue is investigated in Section 4- 



3.5 Asymptotic normality of the (hj)j—i t .„ tS — residuals computed on A n 

In this framework, we consider s different test functions and we compute all hj— residuals on the 
same domain A„, which is assumed to be a cube growing up to R d when n — > +oo. 
Wc present an asymptotic normality result for the random vector (^Ra„ ($] hj, ^r^jj 

Proposition 6. Assume that 

• The parametric model satisfies [Mod]. 

• The energy function and the test functions hj (for j = l,...,s) satisfy [CJ and [Nl-4]- 

• The energy function and the estimate 6 n satisfy [El] and [E2(bis)] . 
Then, the following convergence in distribution holds, as n — > +oo 

|A„r 1/2 (i? A „ (*;>»,• a)) s A^(o,s 2 (r)), (13) 

where S 2 (#*) is the (s, s) matrix given by 

X 2 (6*)=D- d E(R 00 , Ao(D) ($;h,r)R 00 , Afc(D) ($;h,r) T ) , (14) 
|fe|<i 

where R 00 .a(¥', h, 6*) := (Roo,A(f, hj, 6**)) J=1 s , see (8), and where, for all k 6 7L d , Ak(D) is the 
cube centered at kD with side-length D. 



4 Estimation and positivity of the asymptotic covariance 
matrices 

4.1 Statement of the problem 

The aim of this section is to provide a condition under which, on the one hand the matrices S 1 (0*) 
and S 2 (#*), defined in Propositions 4 and 6, are positive-definite, and on the other hand A/ Im , 
involved in Corollary 5, is positive. Then we define estimators of S^ 1/2 ((?*), and S^ 172 ^*). 
As a consequence, we will be in position to normalize and estimate the quantities arising in (9), 
(12) and (13) so that they converge to a free law. 

Before this, let us focus on the particular form of the matrix S 1 (6'*). This (\J\, \ J\) matrix has 
two eigenvalues A/„„ and \u es (respectively denned by (10) and (11)), whose multiplicity is \ J\ — 1 
for Xinn and 1 for \R es . By using the Gram-Schmidt process for orthonormalizing the eigenvectors 
of S 1 (0*), one obtains the explicit form for the squared inverse of this matrix, provided Xinn and 
Xr £S do not vanish: 



y/Xlnn V V Xr cs y/Xl r , 

where J = ee 7 " and e = (1,...,1) T . Therefore, estimating S^ 172 (0*) can be reduced to the 
estimation of these two eigenvalues Xi„ n and Xr £S . 
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Consequently, the estimation of Xi nn and the covariance matrices 5^ (#*) and S 2 (8*) is achieved 
by estimating (10), (11) and (14), which can be viewed as a particular case of estimating the matrix 
(actually a constant for the two first expressions) 

M(8* ) =D- d Y,V (Ya o( d) (*; 9*) Y Ak(D) ($; e*f) , 

fcl<l 

where, according to the assumptions involved in Propositions 4 and 6, for any bounded domain A, 
Ya($; 8) is a random vector of dimension q (q = 1 or s) depending on 8, such that for any bounded 
domains A, Ai, A2 (Ai, A2 disjoint), for any 8 € V(8*), for any j = 1, , . , , q and any <p € CI 

(i) Y AlUA2 ( V ; 8) = Y Al (p; 8) + Y A2 8), 

(ii) e((Y a (<M))-) <+^, 

(hi) e((Y a (*;0)),-|*a«) =0, 

(iv) for any sequence of bounded domains T n , E ^(Yr„ ($; — > as r„ — > 0, 

(v) Y\((p;8) depends only on ip\®D- 

4.2 Positive definiteness of M(0*) 
Let us consider the following assumption. 

[PD] For some A := U^^rD-j Aj(5) with 8 > 0, there exists B <G T and Aq, . . . , A^, (£ > 1) disjoint 
events of fis := |<p € : S B, 1 < |i| < 2 £ | such that 

- forj = J ...,4iV(A,-)>0- 

- for all (</3o,--->w) € A x • • • x the (£, g) matrix with entries (Yj (^; 6**))^ — 
(Y^- (<po\ is injective, which means: 

(Vy e R q y (Y T to;e*) - Yjfa,;**)) = 0) y = 0. 

Proposition 7. From i/ie definition of Ya($;#) and under [PD], the matrix M(6>*) is positive- 
definite. 

Remark 5. TTie assumption [PD] is associated to some characteristics of the point process <&. 
TTie parameter 8 is independent of the parameters involved in the different estimators (e.g. D v or 
8 arising in the next section). Given a model, the event B and 8 are chosen in order to let the 
different configurations sets Aq, Ai, . . . , Ai as simple as possible. For most models, a convenient 
choice is B — and 8 > D (see the examples treated in Appendix B for instance). 

4.3 Estimation of M(0*) 

The dependence of M(0*) on D may be lightened thanks to the following lemma, whose proof is 
relegated to section 8.5. 

Lemma 8. The matrix M(#*) can be rewritten for any 8 > and any D v > D as 

M(6*) = S- d J2 E ( Y AoW 0*) Y Afc ( 5) e*f 

where Ak(8) is the cube with side-length 8 centered at kS. 
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From this result, to achieve an estimation of M(0*), it is required to estimate the involved 
expectation and 9* (by 9 n ). This is enough for the estimation of \i nn for which Ya(^; 0) ~ Ia{<P\ 9). 
But when Y\(ip;9) = Rco,A{<P',h,6) or Y\(ip;0) = R 00 .a(<p; h, 6), which appears in J± 1 (9*) and 
S 2 (0*), it can be noticed that Ya still depends on an expectation with respect to Pg* , through 
the vector £(h,9*) defined by (7). Moreover, the vector Ua in [E2(bis)] may also depend on such 
a term (this is the case for example when considering the maximum pseudolikelihood estimate as 
shown in Section 6.2). This means that Ya(</?;#) cannot be estimated only by Y\((p;9 n )^ but by 
Y ny \((p; 9 n ), where Y„ is an estimator of Y. We assume in the sequel that Y„ satisfies the same 
properties (i) — (v) as Y and is a good estimator of Y (see Proposition 9). The explicit form of Y„ 
depends strongly on the estimate 9 n (e.g. through Ua in [E2(bis)]). When 9 n is the maximum 
pseudolikelihood estimator, we provide explicit formulas for Y„ in Section 6.3. 

Let us now specify an estimator of M(#*). Assume that the point process is observed in the 
domain A„ © D + where D + > D and A„ is a cube. For any 8 such that |A no |<5~ d € N, we 
may consider the decomposition A„ = UkeK n ^-k{8), where the Afc(5)'s are disjoint cubes with 
side-length S and centered, without loss of generality, at kd. For any such 8, according to Lemma 8, 
a natural estimator of M(#*) is, for any D v > D, 



M no feen„M,^ v ) = lAj- 1 ^ J2 y„ 0iA . (S) (yiMvOJ Y„ 0jAfe(5) (ip-XM) ■ 

(15) 

Remark 6. As suggested by Lemma 8, the parameter 8 in (15) may be chosen arbitrarily. Yet, 
while M(fl*) is actually independent of 5, its estimate M„ a may depend on it due to edge effects. 

The following proposition provides a framework to study the asymptotic properties of (15) and 
shows the consistency of M ro when the domain A„ increases up to oo as n — > oo. Its proof is 
relegated to section 8.7. 

Proposition 9. Under [Mod], [El], assume that for any 9 in a neighborhood V(9*) of 9* , for 
any bounded domain A, for any (p G f2 and for j = 1, . . . ,p, (~Y n ^(ip; -J is a continuous function. 
Assume moreover that 



sup 



Y nAk (s n )(^0)-Y Ak{Sn) (^,e) 4 0, (16) 



where, for any S > as above, (5n)n6N is a sequence satisfying \K n \8 n d € N, 8 ng = 8 and 8 n 
as n — > oo. Then, for any D v > D, 



Remark 7. The choice of the sequence (5 n )neN is always possible (see the proof). Since we allow 
the domain A„ to grow continuously up to R d , its decomposition in sub-cubes with side-length 8 
is not always possible. The sequence (<S n ) n gN is thus mandatory to make a decomposition of the 
domain available when n increases. We chose it by respecting as most as possible the initial choice 
of the practicioner. 



5 Goodness-of-fit tests for stationary marked Gibbs point 
processes 

We present in this section three goodness-of-fit tests, based on the residuals computed according 
to the different frameworks considered in Section 3. We assume that the point process is observed 
in the domain A nn D + where D + > D and A„„ is a cube. 
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5.1 Quadrat-type test with | J| — 1 degrees of freedom 

According to the setting detailed in Section 3.4, we divide the domain A„ into a fixed number 
of subdomains, namely A„ := Uj e j A-j,n where J is a finite set and all the Aj. no are disjoint 
cubes with the same volume |Ao irio |. Moreover, in each sub-domain, we consider the decomposition 
Aj,n = UfcgKj Ak(S), for any 6 such that |Ao jno |<5~ <i € N, where the Afc(<5)'s are disjoint cubes 
with side-length 6. 

Following (15), we consider, for any 6 > as above and any D v > D, the estimator 

(*>)) , (17) 

where /C„ = U je j]Cj, no . Note that Ia«(<5) (fi ^no(f)) = #A«(<5) (<P;0no (<£>)) but we preserve this 
redundant notation in the sequel. 

The following corollary is an immediate consequence of Corollary 5 and Proposition 9. 

Corollary 10. Under the assumptions of Proposition 4 and if [PD] holds for Y-r (<&;#*) = 
i^- (<£>;#*), then, for any S > 0, one can construct a sequence (#n)neN satisfying \Ao }Tl \8~ d € N, 
S na = S and S n — ¥ S, such that as n — > +oo 

T ltn := lAo.nl" 1 A~ j n „ x ||R Jin ($; ft) - Rj-,n($; ft)|| 2 -A X 2 (|^| - 1). (18) 

This result leads to a goodness-of-fit test for Ho : $ ~ p B „ versus H\ : $ Pg. • Let us briefly 
summarize the different steps to implement the test for a given asymptotic level a € (0, 1). 

• Step 1 Consider a parametric model of a stationary marked Gibbs point process with finite 
range D observed on the domain A„ © D + with D + > D. 

• Step 2 Choose an estimation method satisfying the assumptions [El] , [E2(bis)] (for example 
the MPLE) and compute the estimate 9 no on A„ . 

• Step 3 

a) Consider a test function h (satisfying [Cl-2], [Nl-3] and [PD]), divide A„ into \J\ 
cubes and compute the ft,— residuals on each different cube. 

b) Estimate Xi nn by (17). 

c) Compute the test statistic Ti,„ involved in (18). 

• Step 4 Reject the model if T hno (<p) > Xi- a (\J\ - !)• 

Let us note that in the particular case of a homogeneous Poisson point process with intensity 
z and when considering the raw residuals (h = 1), this test is exactly the Poisson dispersion test 
applied to the \J\ quadrat counts, also called quadrat counting test, see Digglc (2003) for instance. 
Indeed, in this case, Rj^ n (ip; h) — Rj, n ((p;h) is the vector of quadrat counts and Xi nn = z. 
Considering \Ao, n \X n0t i nn as an estimation of the intensity on Ao, n , the statistic Ti >n reduces to 
the ratio of the sum of squares of the quadrat counts over their estimated mean. 

Remark 8. The condition [PD] in Corollary 10 has to be verified with Y-r- ($;#*) = 7^- ($;#*) 
which is not so difficult (see Proposition 16 for a general result). Indeed, contrarily to Corollary 11 
and 12, this condition does not depend on the form of the estimator 9 n . Moreover, as emphasized 
in Section 3.4, the assumptions of Proposition 4 are satisfied for many models (this will be explored 
in details for exponential models in Section 6.1). This means that the proposed goodness-of-fit test 
based on (18) may be used for many models and many choices of function h. 
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Remark 9. The weakeness of this testing procedure (and the next ones) could be the estimation 
(17) of Xinn (and in general the estimator (15)). The choice of the parameters 6 and Z? v in (17) 
is crucial. For instance, for fixed n, in the extreme cases S — > or _D V — > oo, we get X n j nn ~ 0. A 
careful simulation study should help for these choices. Another improvement could be to estimate 
A/nn "via Monte- Carlo methods. 

5.2 Quadrat-type test with \J\ degrees of freedom 

Under the same setting as above, assume moreover that [PD] holds for Y^- (ip; 9*) = R ^(Vi h,6*). 
Let us define the normalized residuals 

B-l.no (Pi h ) : = Kl!lnn K J,n {<P\ h) + (x^Mes ~ Kojnn) ^J,n {V\ ty, 

where X no j nn is defined in (17) and A„ 0i _R es is an estimate of A_R es following (15). When consid- 
ering the MPLE, explicit formulas for X nQ: R es are given in Section 6.3. It is easy to check that 

~ -—-1/2 

Ri ! „ (( ( 9;/i) = S lra Rj ino (ip; h). Therefore the following corollary is deduced from Proposi- 
tions 4 and 9. 

Corollary 11. Under the assumptions of Propositions 4 o,nd 9, assuming that [PD] holds for 
Y^; ($;#*) = %($;#*) and Yj (<&;#*) = R x ^(<f>; h, 9*), then, for any 5 > 0, one can construct 
a sequence (<5 ra )neN which satisfies |Ao ; „|<5~ d £ N, <5 Il0 = 5 and 5 n — > S as n — > oo, such that as 
n — > +oo, 

?!,„($) := lAo^l^HRi.n^^)!! 2 A X 2 (\J\) (19) 

A goodness-of-fit test with asymptotic size a £ (0,1) is deduced similarly as in the previous 
section. The steps to follow in practice are the same except that in Step 3 b) , one has to estimate 
both Xinn and A_R es , and in Step 4 we reject the model if T^ no (ip) > Xi-a(l^l)- 

Remark 10. Let us emphasize that, with respect to Corollary 10, Corollary 11 involves an addi- 
tional more complex assumption: [PD] has to be satisfied for Y-^-(<£>; 9*) = R h, 9*). This 

kind of assumption deeply depends on the nature of the estimate 9. This problem is investigated 
in Proposition 18 for particular examples. Furthermore, we show in Proposition 17 that Xji es = 
occurs for many models and many choices of h including the Poisson model when h = 1 . These 
two remarks underline the fact that the test relying on Ti jM is more restrictive than the previous 
one with Ti, n . 

5.3 Empty space function type test 

Let us consider the setting of section 3.5, where s different residuals are computed on the same full 
domain A„ . We consider the decomposition A„ = Uk£K nQ Afc(<5), for any S such that |A„ Q \5~ d £ N, 
where the Afc((5)'s are disjoint cubes with side- length S. 

Under the notation of Proposition 6, assuming [PD] holds for Y^- (ip; 9*) = R^ -^(f, h, 9*), let 
us define 

R 2 , no ( W h, 9) := f£~ 1/a (R Ano h h 6)) 

V / J — 1 , . . . ,s 

- — --1/2 - — --1/2 ~ 

where S 2no := £ 2r[Q (ip,9;8,D v ) is an estimation of X 2 (#*) as in (15). See explicit formulas 
in Section 6.3 when considering the MPLE. 

From Propositions 6 and 9, we get the following corollary. 

Corollary 12. Assuming [PD] with Yt (ip; 9*) = R M j^P, h, 9*), under the assumptions of Propo- 
sitions 6 and 9, then, for any 6 > as above, one can construct a sequence (5 n ) n< £iq which satisfies 
|A n |<5„ £ N, S no = S and 8 n — > S as n — > oo, such that, as n — > +oo, 

T2,„($) := |A„| 1 /2||R 2ira ($ ;h ^)|| 2 A X 2 ( S ). (20) 



14 



A goodncss-of-fit test for Hq : $ ~ P 9 * versus H\ : $ *> Pg*, with asymptotic size a £ (0, 1) is 
deduced as before. From a practical point of view, the steps detailed in 5.1 are modified into: 

• Step 3 (framework 2) 

a) Consider s different test functions (satisfying [Cl-2], [Nl-3] and [PD]), and compute 
the s different hj— residuals on the same initial domain A„ . 

— -1/2 

b) Estimate the matrix S 2 (0*) by (15) and compute S 2ji with any numerical routine 
(e.g. a choleski decomposition or a singular value decomposition). 

c) Compute the test statistic T2 t n ( ( fi) defined by (20). 

• Step 4 Fix a £ (0, 1) and reject the model if T 2l no{ l p) > Xi- Q ( s )- 

6 Application to exponential models and the MPLE 

Through Sections 3, 4 and 5 three sets of assumptions have been considered. The first one deals 
with integrability and regularity of the model and the test function(s) and gathers [Mod], [C] and 
[Nl-4]. The second one is about the estimator 9 n and involves [El] and [E2(bis)]. Finally, the 
third one, assumption [PD] is very specific and deals with the positive definitcness of covariance 
matrices. We prove in this section that these assumptions are in general fulfilled for exponential 
family models and the MPLE. 

6.1 Assumptions [Mod], [C] and [Nl-4] for exponential family models 

The energy function of exponential family models is given for any A <G B(R d ) by Va(v;#) = 
(Fva(ip), where va(<^) is the vector of sufficient statistics given by va(^) = (vi^if), v p ,a(<p)) ■ 
The local energy is then expressed as V (x m \ip; 9) = (Fv (x m \(p), where v (x m \(p) = (vi(x m \ip), . . . , v p (x m \(pj) := 
va(^5 U {x m }) — v\(ip). Let us consider the following assumption: 

[Exp] For i — l,--- ,p, there exist n\ lnt ' , K^ 8up ' ) > 0, fc, £ N such that one of both following 
assumptions is satisfied for all (m, <p) £ M x SI: 

Oi > and - 4 ai) < « 4 (0"» = Vi{Q m \ mQ , D) ) < «i" up Vs(o,D)|* i . 

or 

-4 nf) < « 4 (0">) = «i(0 m |^( 0)D) ) < 4 SUP) - 

The assumption [Exp] has already been considered by Billiot ct al. (2008). It is fulfilled for a 
large class of examples including the overlap area point process, the multi-Strauss marked point 
process, the nearest-neighbor multi-Strauss marked point process, the Strauss type disc process, 
the Geyer's triplet point process, the area interaction point process,. . . . 

Proposition 13. Under [Exp], the assumptions [Mod], [C] and [Nl-4] are satisfied for the raw 
residuals, inverse residuals, Pearson residuals or residuals based on the empty space function. 

Proof. The assumption [Exp] implies that the local energy function is local and stable, which, 
from results of Bcrtin et al. (1999), implies that [Mod] is fulfilled. A direct consequence of [Exp] 
is that for every a > 0, for all 9 £ & and for alH = 1, . . . ,p 

E(M0 M |$)r e - eTv ( oA/ '*)) <+oo, (21) 

which ensures the integrability assumptions [C] and [Nl-2] for the residuals considered in the 
proposition. The locality assumption [N4] is contained in [Exp] . Finally, an application of the 
dominated convergence theorem, with the help of (21), shows [N3]. ■ 
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Remark 11. Our setting is not restricted to locally stable exponential family models. As an 
example, following ideas of Coeurjolly and Drouilhet (2009), one may prove that [CJ and [Nl-4] 
are fulfilled for Lennard- Jones type models. 



6.2 Assumptions [El] and [E2(bis)] for the MPLE 

Among the different parametric estimation methods available for spatial point patterns, the maxi- 
mum pseudolikclihood is of particular interest. Indeed, unlike the maximum likelihood estimation 
method, it does not require the computation of the partition function, it is quite easy to implement 
and asymptotic results are now well-known (see Jensen and M0llcr (1991), Jensen and Kunsch 
(1994), Billiot ct al. (2008), Dcrcudre and Lavancier (2009) and Coeurjolly and Drouilhet (2009)). 
The MPLE is obtained by maximizing the log-pseudolikelihood contrast, given for exponential 
models by 

LPL An (<p;6) = - f e-^ v( - xm ^fx(dx m ) - J]v(x">\x m ). (22) 

Proposition 14. Under assumption [Exp] (and an additional indentifiability condition), [El] 
and [E2(bis)] are fulfilled for the MPLE. The vector Ua(v; 0*) in [E2(bis)] is then expressed as 
follows 

U A ( W 6*) := H^r'LPL^; n (23) 
where liPlJ£\(p;0*) is the gradient vector of the log-pseudolikelihood given by 

LPL A ^ (cp; 9*) := f v(x m \<p; 9*)e- e " v ^ m ^ fx{dx m ) - V (- T "> \ 0*) (24) 

J AxM x'^e^A 

and where H(#*) is the symmetric matrix 

H(0*) := E (v(0 M |$; 0*)v(O M |$; 9*f e -^(o AJ l*;0*)) . ( 25 ) 

Proof. [El] is proved by Billiot ct al. (2008) (under [Exp] and the identifiability condition 
[Ident] arising p.244 in Billiot et al. (2008)). Let Z n (tp;9*) := -|A n |" 1 LPL An (¥'; 9*). If n {y) = 
0^ IPLE ((p) denotes the maximum pseudolikclihood estimate, one derives 

Z«^; 9 n ) - Z« (ip; 9*)=0~ Z« fo>; 9*) = H„( w 9\ 9 n ){9 n {ip) - 9*) 

with Il n ((p;0*,9 n ) = J *Z^ 2) (cp; 9* + t(9 n ((p) - 9*)j dt. Under assumptions [Exp] and [Ident], 

then, for n large enough, H„ is invertible and converges almost surely towards the matrix H(#*) 
given by (25). Moreover, following the proof of Theorem 2 of Billiot et al. (2008) (see condition 

(Hi) p.257-258), we derive Var(Z ( n ] ($; 9*)) = 0(|A„|- X ). So 

|A„| 1 / 2 ((?„($)-^)+H- 1 (r)zW($ ; r)) = -KlV^H-i^^^-H" 1 ^*)) z£>(*;0*) 

-> 0, 

in probability as n — > +oo. This implies (23). Finally, Ua(c^;#*) fulfills properties (i) — (v) in 
[E2(bis)] for the same reasons as in the proof of Proposition 13 and, for (Hi), from the proof of 
Theorem 2 (step 1, p. 257) in Billiot ct al. (2008). ■ 

Remark 12. In the same spirit as Remark 11, let us underline that the MPLE also satisfies [El] 
and [E2 (bis)] for some non locally stable and non exponential family models, including Lennard- 
Jones type models (provided a locality assumption). 



16 



6.3 Estimation of asymptotic covariance matrices when considering the 
MPLE 

We still focus on exponential family models. As in Section 4.3, we assume that the point process 
is observed in the domain A„ © D + where D + > D and A„ is a cube. Moreover, we consider 
the decomposition A„ = Ufc e je n Afc(<5), for any 8 such that |A no |(5~ d € N, where the Afc(<J)'s are 
disjoint cubes with side-length <5 and centered, without loss of generality, at kS. 

From (8) and (23), we have under the assumptions [Exp] and when considering the MPLE 

RooAv, h, 9*) := I A (tp; h, 9*) - LPI/%; 0*fw(h, 9*) (26) 

where W(/i, 9) := il(9)~ 1 £(h, 9). A natural estimator of W(/i, 9*) is given by W„ (iy9; h, 9„ ) := 
Hno(v; Q^^Znoiw h, 0„ o ) where 

H„>;0„ o ) = \ArJ- 1 [ v(x™\p)v(x m \<pfe- r ^ m M»(dx m ), 

Ja„ xM 

£ no (<p;h,9 no ) = lA^r 1 / h(x m ,<p;ljv(x m \<p)e~< v ( xm ^v(dx m ). 



In this spirit, let i?„ ,oo,A(<p; h,9 no ) := I A (ip; h,9 no ) - LPL\ (ip; 9 no ) W no (ip;h,9 no ) and 



Rno^, a( ( / ? ; h, 9„ ) := I Rn .oo.A( ( P't hj, 9 no ) ) . Based on these notation, we obtain the fol- 

V ' ' / j=l,...,s 

lowing estimations for A/ n „, \R es and E 2 (#*) 

K 0! i nn (cp,e no (cp),5,D v ) = |A„ | _1 Yl Ia *(<5) (^^(v)) Iaj(S) {y,^ no (cpfj , 

Ano.-ResCv, 6 na {ip),8, D V ) — |A„ Q | 1 ^ #oo,Ai(<5) (V5 ft, 0n o )Roo,Aj(S) ft, ^n )j 

(v;h, no )R OO) A i (5)(v;h, 0„ o ) . 

Corollary 15. Under the notation and assumptions of Propositions 4 cind 6, and under [Exp], 
then, for any 8 > as above, one can consider a sequence S n which satisfies S„ = 6 and S n — > 5, 
such that for any D y > D, the estimators X n j nn (^>, #„(<!>), 8 n , D y ), \ n ,Res(^j 9 n (^), S n , Z) v ) and 
S 9rl ($, #„($), 8 n , D v ) converge in probability (as n — > +oo) towards respectively Xi n n, Xn es and 
£ 2 (0*). 

Proof. We apply Proposition 9, where for any 9 £ V(0*), we set 

• for\ Inn : Y A (tp;0)=Y A (!p;6)=I A (tp;h,e). 

• for X Re8 : Y A (ip;9) = R^a^; h, 9) and Y niA (<p; 0) = Rn,oo,A(v; ft, 0). 

• for £ 2 (0*): Y A {<p- 9) = RocaCv; h, 9) and Y n<A ((p; 9) = R„.oc,a(w h, 9). 

The result is obvious for A/ n „. For A# es (the proof is similar for S 2 (0*)), it remains to prove 
that for any 6 0, sup A . gK ; n fl„,oo,A fc (c5 n )($; ft, 0) - -Rn,oo,A fc (<5 n )($; ft, ^ in probability as 
n — > +oo. For any € /C n , we derive 

Rn,oc,A k ( Sn )($; ft, 0) - R^A k ( Sn )(^ ft, ^) = LPL« (5n) (^; flf (w(/i, 0) - W„(^; ft, 0)) . 
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Assumption [E2(bis)] (implied by [Exp], sec Proposition 14) ensures that |LPL^ S >. Afe (^)(^; 0)\ 

converges to in quadratic mean. In particular, the convergence of LPL^m ^($',9) towards 

LPL^,«($; 6) holds in probability. Moreover under the assumptions [Nl] and [E2(bis)], the 

ergodic theorem of Nguyen and Zessin (1979b) may be applied to prove that W„($; h, 9) converges 
almost surely towards W(/i, 9), as n — > +oo. Slutsky's theorem ends the proof. H 

Remark 13. If the model is not an exponential model, Corollary 15 still holds by replacing the 
vector of the sufficient statistics, v(x m \tp), by the gradient vector of the local energy function, 
V^ 1 ) (x m \ip) in the different definitions. 

6.4 Positive definiteness of covariance matrices when considering the 
MPLE 

Let us now focus on the positive-definitess of the above quantities. According to Proposition 7 the 
key assumption to check is [PD] . 

As adressed in Remark 8, we begin by giving a general result ensuring that A/ Im > 0. 

Proposition 16. Under the assumption [Exp], then Xi nn > for the raw residuals, the Pearson 
residuals and the inverse residuals. 

Proof. In [PD], we fix 6 = D and B = 0. Let us write fl := £1$. Consider the following events 
for some n > 1 

A = {tp en : <p(A (5)) = 0} and A n = {<p e Q : (p(A {5)) = n} , 

and let <po G A and ^„ g i„. Recall that the local stability property (ensured by [Exp]) asserts 
that there exists K > such that V (x m \(p; 9*) > —K for any x m £ S and any ip € Q,. Now, let us 
consider the three type of residuals. 
Raw residuals (h = 1). From the local stability property 

li^; M*)-Ja(w>;WI >»- 

for n large enough. And so assuming that the left-hand-side is zero leads to a contradiction, which 
proves [PD]. 

Inverse residuals (h = e v ). Again, from the local stability property 

\I^((p n ;h,6*) ~ I T (ip ;h,9*)\ = 

which proves [PD] similarly to the previous case. 
Pearson residuals (h = e v l 2 ). From the same argument 

|%te„;M*)-lA(w);M*)l > 



> 

for n large enough, which ends the proof. H 

Proposition 16 asserts that [PD] is fullfilled for Y T ($;9*) = I^- ($;#*). Therefore, the com- 
bination of Propositions 13, 14 and 16 and Corollary 15 ensures all the conditions of Corollary 10 



,-V(a 



-V(x m \ Vo :0*) 



fi(da 



AxM 



> n-2\A\e K > 0, 



E 



V(x m \<p n \x m ;6*) 



> ne 



-K 



>o, 



E 



V(x m \ Vn \x m ;6*)/2 



x m £tp„ 



e -v{x™\ Vn -r )/2 _ e -v(x m \<Po;e*)/2^ dxm) 

AxM 

K ' 2 - 2\K\e K ' 2 > 0, 
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hold. So a goodncss-of-fit test based on (18) is valid for exponential family models satisfying [Exp] 
and for the raw residuals, the Pearson residuals and the inverse ones. 

Now, let us focus on tests based on Corollary 11 and 12. The following result is important 
from a practical point of view. It asserts that Xr C s (and so S 1 (0*)), and S. 2 ((9*) may fail to be 
positive-definite for an inappropriate choice of test function. 

Proposition 17. Let us consider an exponential family model, let 9 := Q MPLE and let us choose 
a test function of the form h(x m , <p\ 9) = ul r v(x m \(p) for some w £ l p \ 0, then Xfj es = and the 
matrices 11 1 (9*) and S 2 (#*) in Propositions 4 ond 6 are only semidefinite- positive matrices. 

Proof. The result is proved by noticing that 

H(0*)w = E(v(0 M |<i>)v(0 M |$)V v ( oM l*^)) u 
= E (v(0 M |$)(a7v(0 M |$))V v (° M l* i9 *)) 
= E (li(0 M , $; 0*)v(O M |$)e-^(° M l* ;r )) = 8 (uFv, 9*). 

Therefore, W(cJ r v, 0*) = Il(6*)~ 1 S(tJ r v, 9*) = u>, which means that for any tp £ f2 and any 
bounded domain A 

fl^A^ofv,*?*) = I h { V ;Jv,6*) - LPL^fa^fw = 0. 

This means that if, for the framework 1, the test function is of the form h = t/v then A^ es = and 
if one of the test functions, for the framework 2, is of the form h = uFv, then S 2 (6>*) is necessary 
singular. ■ 

Remark 14. As for Corollary 15, the result of Proposition 17 still holds in general by replacing 
the vector v(x m \<p) by the gradient vector of the local energy function ~V^(x m \ip). 

As a consequence of Proposition 17, the two goodness-of-fit tests based on T[ n and in 
Section 5.2 and 5.3 are not available (for the MPLE) if the test function h is a linear combination 
of the sufficient statistics v(x m \tp). Since for most classical models, the value 1 can be obtained 
from a linear combination of v(x m \tp), the raw residuals (h = 1) are not an appropriate choice 
for these two tests. This is the case for the two following examples: the area-interaction point 
process and the 2-type marked Strauss point process, which are presented in details in Appendix 
B. The following result proves that for a different choice of ft,— residuals, S 1 (0*) and S 2 (0*) are 
positive-definite. 

Proposition 18. For the 2-type marked Strauss point process and the area-interaction point pro- 
cess, when considering the MPLE as an estimator of 9* , then 

• the matrix 5^(0*) obtained in Framework 1 from the inverse residuals h = e v , 

• the matrix S 2 (#*) obtained in Framework 2 from the empty space residuals, which are con- 
structed for < ri < . . . < r s < +oo from the family of test functions 

h 3 {x m ,^ 9) = l [0ir]] (d(x m ,<p))e v ^> e \ j = 1, . . . , s, 

are positive-definite. 

The proof of this result is postponed in Appendix B. The combination of Propositions 13, 14, 18 
and Corollary 15 ensures all the conditions of Corollary 11 and 12 hold. So a goodncss-of-fit test 
based on (19) (resp. (20)) is valid for the 2-type marked Strauss point process and the area- 
interaction point process and for the inverse residuals (resp. the family of test functions based on 
the empty space function). 

Following the Proof of Proposition 18, it is the belief of the authors that such a result holds for 
other models and other choices of test functions. However, another model and/or test functions 
will lead to a specific proof. Therefore, this result cannot be as general as the one presented in 
Proposition 16. 
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7 The non-hereditary case 



Up to here, we have assumed through [Mod-E] that the family of energies is hereditary. We 
consider in this section the non-hereditary case. This particular situation can only occur in presence 
of a hardcore interaction. From a general point of view, we say that a family of energies involves 
a hardcore interaction if some point configurations have an infinite energy. Many classical models 
of Gibbs measures include a hardcore part, as the hard ball model. 

A family of energies involving a hardcore part is hereditary if (2) holds. This is a common 
assumption done for Gibbs energies and it appears to be fulfilled in most classical models, including 
the hard ball model. However, one may encounter some non-hereditary models, in the sense that 
(2) does not hold. Intuitively, in this case, when one removes a point from an allowed point 
configuration, it is possible to obtain a forbidden point configuration. This occurs for instance for 
Gibbs Delaunay-Vorono'i tessellations or forced-clustering processes (see Dcreudre and Lavancier 
(2009) and Dcreudre and Lavancier (2010)). 

In the non- hereditary case, the GNZ formula (3), which is the basis to define the residuals, 
becomes false (see Remark 2 in Dereudre and Lavancier (2009)). It is extended to non- hereditary 
interactions in Dcreudre and Lavancier (2009). This generalization requires to introduce the notion 
of removable points. 

Definition 3. Let cp £ and x € ip, then x is removable from tp if there exists A £ B(R d ) such 
that ieA and V\(tp — x; 9) < oo. The set of removable points in (p is denoted by lZ(cp). 

Notice that in the hereditary case, TZ(ip) = (p. 

The GNZ formula is then generalized to the non-hereditary case as follows. Assuming for 
any 9 G that a Gibbs measure exists for the family of energies (Va(-; #))Ae8(R d )j then, for any 
function h(-, ■, 9) : $ x Q, — > M. 2 such that the following quantities are finite, 

e( [ h{x m ,<S>-9)e~ v{xm ^ r) n{dx m )) =E f V h (x m , $ \ x m ; 9) \ . (27) 

V-VxM / \ xmenm J 

We can therefore define the h— residuals for (possibly) non-hereditary interactions. For any 
bounded domain A, if 9 is an estimate of 9* , the h— residuals are 

R A (y; h, t\ = I h^x m ,ip;9^e- v ( xm ^)^(dx m )- J2 h (x m , p \ x m -,9^. (28) 

X m <£Tl(<pA) 

If the set of removable points lZ((p) does not depend on 9, it is straightforward to extend all 
the asymptotic results obtained for the residuals in the preceding sections to (28). 

If the set of removable points depends on 9, this is false. Even in the hereditary case, if 9 is a 
hardcore parameter (as the hardcore distance in the hard ball model) then 9 behaves as an estimator 
of the support of the distribution P§. In this case assumption [E2] has typically few chances to 
hold and the asymptotic law of the residuals is unknown. In Dcreudre and Lavancier (2010) Figure 
15, some simulations of raw-residuals for Gibbs Vorono'f tessellations are presented, involving an 
estimated hardcore parameter in a non-hereditary setting : they show that the distribution of the 
residuals does not seem to be gaussian in this case. 



8 Proofs 

Since any stationary Gibbs measure can be represented as a mixture of ergodic measures (see 
Preston (1976)), it is sufficient to prove the different convergences involved in this paper for ergodic 
measures. We therefore assume from now on that Pg* is ergodic. 
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8.1 Proof of Proposition 2 

(a) Under [CI], the ergodic theorem of Nguyen and Zcssin (1979b) holds for both terms appearing 
in the definition of 7 A (p; h, 6*). Then, as n — > +oo, one has Pg*— a.s. 

|A n |- 1 / An ($; h, 9*) ->■ E (h (0 M , $; 0*) e-vKW*)) - E (h (0 A/ , $ \ M ; 0*)) , 

which equals to from the GNZ formula (4). 

(b) The aim is to prove that the difference \A n \~ 1 R^ up; h, O n (ip)j — |A n | -1 /^ (ip; h, 0*) converges 
towards for Pg* — a.e. ip. Let us write 

R kn (<p;h,O n (<p)) -I- An ($;h,6*) := T x (ip) - T 2 (ip) 

with 



Ti(<p) := / (f(x m ,ip- ) e n (ip))-f(x m , ( p;e*))fi(dx m ) (29) 

JA„xM V V 7 7 

Tafoj) := J] ft(a m > v\!C m ;to)-A(a:' n , V \ a: ' n ;ff*). (30) 



Under the Assumptions [C2] and [El], from the ergodic theorem and the GNZ formula, there 
exists hq G N such that for all n > no 

IKl- 1 ^) < 3- f (^M-r)V)(x m ,^r)M^ m ) 

|A„| Ja„xm v 7 

< 2\\9 n (p)-6*\\x-J- [ \\fW(x m ,<p;6*)Mdx m ) 

|A„| JA„xM 

< A\\9 n (ip)-e*\\xB(\\f^(0 M ,^e*)\\), (31) 

and 

|A„|- 1 T 2 ( ¥ p) < 3- (0n(<p)-e*) T hW(x m ,<p\x m ;6*) 

I n| x m £ip^ 

< i\\9 n ((p) - 0*\\ x E(||h« (O M ,$;0*) || e - v (° M| * i9 *)) . (32) 
Equations (31) and (32) lead to 

ikr'R^ (<p;hX(<pj) - |A«r^A„ toM*) < c\\9 n (<p) - 9*\\, 
for n large enough, with c = 4 x E (||fW (0 M ,$;6»*) || + || h.< x > (0 M , $;(?*) || e - y (° M l <I>;< '*) 



8.2 Proof of Proposition 3 

Recall that 



R kn (ip- h, 6 n (<p)) - I kn (ip; h, 9*) = T x (ip) - T 2 (p) 



where T\(ip) and T 2 (ip) are defined by (29) and (30). Let us write 

[e n {ip)-e^ 

'A„ xM 



Ti(ip) = f (e n (<p)-e*) T fW(x m ,ip;e*)»(dx m ) + T{(ip) 
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with 



A 1 lx m ,Lp;6 n (ip) )ti{dx m ) 

A„xM V ' 

]T A 2 (x m ,<p\x m ;6 n (v)] 



and 



M (x m ,cp;9 n (<pf) := f (x m , ip;0 n ) ~ f (x m , <p; 9*) - (o n (<p) - 9*ff^ ( x ™, ^ 9*) 



A- 



From the mean value theorem, there exist for j = 1, . . . ,p, 

G [min(0i,#*),max(0i,0*)] x ... x [min(# p , 9*), max(# p , 9*)} such that 



A! 



(x m ,ip;9 n (<p)) = ^{9 j -6*)[ff ) {x m ,^,i x , j )-ff ) {x m ,^9*)) 



(33) 



3 = 1 



A 



(x m ,<p \ x m -9 n {v)) = Yfij - 0*) (hV {x m lV \ x m ;&j) - hf> (x m , <p \ x m ; 9*)) . (34) 



Let j G {1, . . . ,p}, again from the mean value theorem, there exist for £ = 1, 2 and for k = 1, . . . ,p, 
Vej.k G [v3ia(Ct,j,i,0i),max(^e,j t x,0^)] x ... x [minf&^p, &*), max(& t j lP , 9*)] such that 

ff\x m ^ 1 , j )-ff ) {x m ,i P ;9*) = X)(6j,fc-^)/ifc ) (^ m ,Whj,fc) (35) 
ftf (x"> \ x m ; & .,-) - fcf (x m , ^ \ x m ; r ) = ^ (6,,,. - 0£) ftg> (* m , ^ \ x m - m,j lk )(36) 



fc=l 



By combining (33), (34), (35) and (36) and under [Nl], we can deduce the existence of no G N 
such that for all n > no, one has for Pg*— a.e. ip 

lA^- 1 ^)! < J-r f El^-^-^/ffc^W^) M^ m ) 
|A n | Ja^m^ 1 



< 2||0„( ¥ >)-0*|| 2 x 



1 



< 4||0 n fo>) 



3* II 2 



|A„| JA„xM 



E(||f (2) (0 M ,$;9*)\ 



||f( 2 > (x m , w r)||/i(^ m ) 



and 



lAni- 1 !^'^)! < j2 Y<\fo- 6 Vfa- e i) h tkHx m ,v\x m -,o*) 

|A„| a-mg^ 

< 2||a„M-r!| 2 x^L £ HhW^, ¥>\x ro ;0*)|| 

|A n | x m. Eif>An 



< 



4\\0 n (<p) - 9*\\ 2 x E (||h( 2 > (0 M , $; 0*) He-^Kl*^)) 



Since 



1/2 



\K\ 1/2 \\9 n (<p) &*f = I j^l j || |A n |V2(e n( ^ _ x ||0 nM _ * || 
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then, under the assumptions of Proposition 3, one has, from Slustsky's theorem, the following 
convergence in probability as n — > +00 

\A n \V 2 \\6 n (<Z>)-6*\\ 2 ^+0. 
By combining all these results, one obtains the following convergence in probability, as n — > +00 



\K\- 1/2 (^(3) - T 2 ($)HAn| («»($) -^X^*)) = \A n \-^ 2 (T{(^)-T^)) AO, 
where X A ($) is the random vector defined for all j = 1, ... ,p by 

(X An (*)) := -J- / J] ^(x^^N^ff*). 

J |A„| ja„xm |A„| xm£$A 

By using the ergodic theorem and the GNZ formula, one has Pe*— a.s. as n — > +00 

(X An ($)) . -> E (/« (0 M - fcf (0 M ,<M*) e- v (° U ' $;9i )) . 

Finally, let us notice that for all (to, 93) € M x fl and for all j = 1, . . . ,p 





/f(0 m ,^) = ^- (h(0 m ,<p;8)e- v V m M) 



d0 o 

= hf ] (0 m , <p; 0*) e -v(v m \rfi*) _ h ( Q m ^ ^ ^(1) (0 ™^ ; r) e -V(o">;n. 

Therefore (X A (^)) ■ ~£jQ l >Q*) -Pe*— a.s. as n — > +00. This finally leads to the following 
convergence in probability, as n — > +00 

8.3 Proof of Proposition 4 

Let us first state a result widely used in the following. 

Lemma 19. For any bounded domain A and for any test function h 

E(I a ($;M*)I*a«) = 0. (37) 

The proof of Lemma 19 is omitted since it corresponds to the proof of Theorem 2 (Step 1, 
p. 257) of Billiot et al. (2008) by subsituting v j (x m \cp) by the test function h(x m , ip; 6*). 

For all n E N, the domain A„ is assumed to be a cube divided as A„ = Ujej Aj >n where for all 
j € J\ the Aj >n 's are disjoint cubes. So |A n | = |i7||Aj )n | = |J"||Ao,n|. Moreover, for all j £ J, we 
can decompose each Aj <n in the following way : 

Aj, n := [j A k (D n ) (38) 

where the Afc(.D n )'s are disjoint cubes with side-length D n and JCj. n C Z d . The side-length D n is 
chosen greater than D and as close as possible to D, leading to 

n _ lAnl 1 ^ 



\J\l/d |A„|V" I 
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This choice implies D n — > D when n — > oo and guarantees D < D n < 2D as soon as |A„| > |^7"|D d . 
The cubes A J: „'s are therefore divided into \K.j <n \ = \K ^ n \D~ d cubes whose volumes are closed to 
D d . Denoting JC n = U jeJ lCj,n, we have \IC n \ = \A n \D~ d = \ J\\JCj in \ and finally 

A„ = (J |J A k (D n ) = (J A fc (£)„). (39) 

From Proposition 3 and under Assumption [E2(bis)], one has for any j G J 

\A^ n \- 1/2 R A]:n (#;M„(*)) = lAj^-^Roo^n (*;h,0*) + op(l), 

where Roo,A j: „ (&;h,0*) is defined in (8). 

Therefore the proof of Proposition 4 reduces to the proof of the asymptotic normality of the 
vector {\Aj,n\- 1/2 Rco,\ jtfl ($! K 0*)) jeJ - Now 

|A ,„ 

|A ,„| 1/2 



lA^r 172 ^^ (S;M*) = |A 0) „r 1/2 ( I Hn - ^fjU A „ (<f>; 6*) T £{h; 9*) 



|An| 

1 



|,7| X 7 A . n ($; ft, 0*) - U A „ ($; 0* ) T £(ft; 0*)) 

^ £ ^,n,A fc (D„)(*;0*), (40) 



fce/C„ 



where for any <^s G 



'Al{D n ) ■= \J\ X lA k (D n ) (wh,0*) 

W jinAk(Dn) (<p;9*) = < l -U Ak{Dn) (^e*) T £(h;0*) if k G (41) 

Therefore, to prove a central limit theorem for the vector (| A J , n |~ 1 / 2 i? 00 .A J „ h, @*))j e ji 
it suffices to apply Theorem 21 (see Appendix A), where in its statement we choose Ti n ,k = 
(Wj tn _A k (D„)(^',9*))j e j, X n ,i = $Ai(u„) and P = \<J\- F° r this, we first have to specify the 
asymptotic variance matrix X, then to check the assumptions of Theorem 21. 

First step: computation of the asymptotic variance. 

Let us fix a cartesian coordinate system such that is the center of A„. We assume, without 
loss of generality, that \S\ is odd. Moreover, we can always choose an odd number \K.^ n \ of cubes 
Ak(D n ) in (55). Consequently, A ,„ may be centered at and each Ak(D n ) is centered at kD n , 
k G II 1 . Note that if \ J\ was even, each Ak(D n ) would be centered at kD n /2. So, in this system, 

A ii/d \ d 

K„ is a subset of Z , independent of D n , with |/C„| = | J\ \j\i/ dD elements. 
Set, for all k, k' G Z d , 



4 2 1, (AO := E (wS (Db) 0*) <>, (Db) (<&; 0*) 
Note that from the stationarity of the point process, we have E^\,(D n ) = E^ l \_ k ,(D n ), for 



I = 1,12,2. Moreover, under Assumptions [N4] and [E2(bis)], for any k G fC n and for any 
configuration tp, since D n > D, Wj±, D j (<p;9*), i = 1,2, depends only on <£a ( (£>„) f° r — fe| < 1 
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that is I G Bfe(l). As a consequence, if k' g B£(l), , D j ($;6>*) is a measurable function of 
^ A|(_d„)- This leads, for i, j = 1,2, to 

(42) 

From Lemma 19 and under [E2(bis)] then for any k <E II 1 and for i = 1,2, 

E (wi2<A.) (*! e *) l $ A|(D„)) = 0. (43) 
From (42) and (43), we deduce that, for I = 1, 12, 2, 

fc'eB^(l)^B®,(5„) = 0. (44) 
We are now in position to compute the covariance. For any i and j in ^7, from (40), 

cow(|A il „|- 1 / 3 il 00 ^ iB ($;h,e*),|A i , n |- 1 / 2 ie 00lA ,, B ($;h, 0*)) 

= ^ E (p^E w ^A^D n) (^e*)w j!nAk , iDn) (^enY (45) 

Let us first consider the case i = j. We may write 



E 



E E wWaoOW 2 ) 

V 1 keic n k'eic n / 

= ^-r( E 41(^+2 £ 4S?(ao+ E 4SwY 

:— oi '—S2 '■ — S3 

The following lemma will be useful to drop the dependence on Z) ra in each term Si, S2, S3 
above. 

Lemma 20. .For any i,j = 1,2, denoting Ao(r) = Ufc g ]Bo(i) ^fc( T ) f/ or some r > 0), we /iaue 
, 0*) ($-0*) W [i) (^■6*)W^' > ($-6*) 

VV Ao(D n ) \ v ' ° > VV A (D n ) 1 ' I ^A (D) ^' 17 ^ ^A (D) ^ ' ^ ' 

Proof. For any i = 1, 2, ^ is a linear combination of /a (A) an d Ua (_d„), which converge 

respectively in L 2 to /a (d) an d Ua (i (d) by [N3] and [E2(bis)], since D n — > F>. Thus W^ofZ) ) 
converges in L 2 to W* / n \ asn^oo. Similarly, for any j — 1, 2, VF-^. „ , tends in L 2 to IV-r ,„,. 
The convergence stated in Lemma 20 then follows. M 

Let us focus on the asymptotic of each term Si, S2, S3. 

Term Si : from (44), 

E ( E E 4Ii(^))=E E j©(ao. 



25 



Let ICi tn := Ki >n n (Uj e g/Q „ Bj(l)) and note that j^'H — > as n — > +00. Then, 

si= E E E E 

fceJC«,„VCi,„ fc'6B fc (i)nK,,„ fce/Q.„ fc'eB fc (x)nAC<,„ 



Since, 

1 1^ 



(because £) < D n < 2D and j^''"] — > 0), we obtain, asm +00 



bl irl 2^ o,k\ n ) ~vic T~ 2^ \ki D n)- 



|/C "' |/C "' fee fe(l) ' |/C "' fc6 B (l) 

From Lemma 20, 



fceBo(i) feeBo(i) 

Therefore, 

\h Sl ~w\ s 

^ |J| fc 6 Bo(l) 

Term S2: with similar arguments as above, we obtain 

S2 = E E 

k£K Z}n \K 1 , n k'elC n \!C,, n 
=0 

+ E ( E *®(ao+ E 42? (ao) 



=0 



Therefore, since — > and Z) < Z? n < 2D, 



11 11 fcGBo(l) 



Term 5 1 ,-! 



% = E E *&(a.) + E E 

fce/c„yc,.„ k'6B t (i)n(/c„\K,.„) fce/c„\/c,.„ fc'eB=(i)n(K„\/c,.„) 

v v ' 

=0 

Let /C n = (/C„ \ K-i^n) n (UjgafKnXKi n)^i(l)) an d note that j^pj — > 0, as n — > +00. Then, 

E E *S<(ao+E E 

fcefc„\K„ fc'eB fe (i)n(K„\/Ci,„) feeKn fe'eB fe (i)n(/c„\/c 4 ,„) 

" v ' 

:=A 3 



2G 



Since, 



n— >-\-oo 



o. 



we obtain, from Lemma 20, 



1 |/C„ \ /c r; 

<J3 r 



E *S(ao 



171-1 
1^1 



E ^>)- 



feGBo(l) fcSBo(l) 

Combining the three terms Si, 5*2 and S3, we have, as n — > +00 



i,n,A fc (D„) 



E 

feeBo(i) 



(46) 



When i 7^ 7, there are three main cases in (45), according to k,k' G /C; jn , k,k' G /Cj, n , or 
k, k' £ K n \ (ICi^n UK,j tn ). As for the case i = j treated before, the other situations involve non-zero 
correlations on edges sets like lCi in , which are negligible with respect to \K n \. The covariance is 
therefore equivalent, up to Df l \J\, to 



The simplification occurs as for the case i = j and, since |/Ci n | = |^, ra |, we obtain the asymptotic 
equivalent for the covariance (45) 



2 EM(D) + ^E${D) 



\J\ 



(47) 



Finally, from (46) and (47), we deduce that S 1 (6'*), defined in Proposition 4, corresponds to 
the asymptotic variance of (|Aj.„| _1 / 2 i? 00 .A J „ (3>; h,6*)) 

Second step: application of Theorem 21. 

We apply Theorem 21 with Z n>k = (Wj !njA ^ Dn) ) je j, X nA = $A i (D n ),P= \ J\ and S = 5^(0*), 
which is a symmetric positive-semidefinite matrix as the limit of a covariance matrix (from the 
first step of the proof). 

The assumption (54) holds from [N4], [E2(bis)] and because D n > D. Assumptions (i), (ii) 
and (in) are direct consequences of [E2(bis)], [N2] and Lemma 19. It remains to prove (iv). 
Assuming X = (S^) for 1 < i, j ' < p, from the definition of the Frobenius norm, we have 



E 



\ICn\ 1 E E Z„ : fcZ„ : fc' T — £ 

fce/c„ fe'eB fc (i)n/c„ 



v v 

^EE* 

i=l 3=1 



1 E E Wi,n,A fc (13 tl )Wj jn ,A J ,,(r» n ) - £»j 

fce/c„ fc'GB fc (i)nK;„ 



(48) 



Let us first assume that i ^ j are fixed and denote Y n ^(D n ) = VKi, n ,A fc (D„): ^n(^n) 
Efe'eBfcWn/C Wjf.n.A,,, (£>„)• We have 



E 



X E E W/ i,n,A fc (D„)W / j,n,A fc ,(D„) - 

keK n fc'GB fc (l)n/C„ 



= |/c„r 1 E 2 y„ jfe ( J D„)^(Z> n )-S i , 
fce/c„ 

<£;i + £; 2 + £;3 + ^4, 
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where 



771 \K-i,n\ -p, 



l^nl" 1 (^,fe(^™) 5 n(A l )-E(y n , fc (A i )5 I ';(^n))) 



l^r 1 J2 (Y n ,k( D n)S*(D n )-E(Y nik (D n )S*(D n ))) 



keiCj 



„ |/C„ \ (X^n U /Cj |T1 )| 
^3 = tt^—, X 



E 



\K n \ 

\K n \ U /C^Or 1 Y, (Y n ADn)S*(D n ) - V{Y ntk {D n )S k n {D n ))) 



E 4 = 



\)Ci,n\ 



E(Y n , k (D n )S k n (D n ))+ l -^j J2 nYnAD n )S k n (D n )) 



k£K, 



+ 



\K 

\IC n \ (K^n U ICj,n)\ 
\IC n \ 



E(y„, fc (D„)5^(D„))-S ij 

fee/c„\(/c I ,„u/c 3 ,„) 



The first three terms E\, E2 and E3 can be handled similarly. Let us focus on E\: 

\Kr, 

— rji ^ |*-i,n| 

fce/Q 



■£i < l/c^i" 1 ]T E|y nifc (D n )5*(D„)-y Tl ,j k (u)fir*(D)| 



J2 (Y n , k (D)S k n (D) - E(Y n . k (D)S'(D))) 



keKt 



£ l E ( y »,fe(£')5^( J D))-E(y n)ft (£i„)5^(D„))| 



(49) 



Up to the edge effects which are negligible with respect to |/Cj )71 |, (Y ny k(D)S k (D)) , is stationary 
when fc G /Ci. n , since in this case, from (41), W^a^d) = ^aI(d) does not depend on n. Therefore 
the second term in (49) tends to by the mean ergodic theorem. For a fixed n, we have also by 
stationarity (up to the edge effects) 

E \Y n , k {Dn)S k n {D n ) - Y nM {D)S k n {D)\ = E \Y nfi (D„)S°(D n ) - Y nfl (D)S° n (D)\ 



E 



W 



(i) 



W 



(i) 



,w4 2) 



A (£>„)"a (D„) Aq(-D) A (£>) 



where A (D n ) = L>k'£B {i) ^fc' (D n ). From Lemma 20, this term tends to 0, therefore the first term 
in (49) asymptotically vanishes. The same argument shows that the third term in (49) also tends 
to as n — r 00. As a consequence, E\ tends to 0. 

The same decomposition as in (49) may be done for E2 and £3, which leads by similar arguments 
to E-2 — > and E3 — > 0. The last term E4 involves the difference between and its empirical 
counterpart. The same calculations as in the first step of the proof shows that E 4 — > 0. 

Therefore, we have proved that the terms in the double-sum (48) corresponding to i 7^ j 
asymptotically vanish. The same result can be proved similarly when i = j. Thus assumption (iv) 
in Theorem 21 holds and the convergence in law is deduced. 
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8.4 Proof of Proposition 6 

We can decompose A„ in the following way : 

A„:= |J A k {D n ) 



keK„ 



where the A^'s are disjoint cubes with side-length D n and JC n C Z rf satisfies \JC n \ = \A n \D n d . 
Similarly as in the proof of Proposition 4, we choose 



D J 

which implies D n — > D when n — > oo and guarantees D < D n < 2D as soon as |A„| > D d . 
From Proposition 3 and under Assumption [E2(bis)], for all i = 1, . . . , s, 

\A n \~ 1/2 RA n ($;hi,6 n (*j) = \K\- 1/2 Roo,A n ($;hi,6*) + o P (l) 

= \A n \-^ 2 (7 A „ ($; hi,8*) - U A „ ($; 8*?£(hi;8*j) + o P (l) 



= n Lr ii/ 2 S ^A fc (D„)(^;fai,g*) + op(l), 

where for any ip G SI 

^a,(d„) (v; fc, 0*) == ^Ak(D n ) (v; fc, e*) + u Afc(Al) (v; 0*f £(&,•; 0*). 

We apply Theorem 21 in the simpler case when /„ fe = / for all n e N and all k € /C„. 
If D n = D for all n, this framework would reduce to a stationary setting similar to Theo- 
rem 2.1 in Jensen and Kiinsch (1994). But as A„ is allowed to increase continuously up to 
M. d , D n = D is impossible. We will therefore apply Theorem 21 in Appendix A with Z„.fc = 
(W Ak (D n ) {&;hj,6*)) j=1 _ a , X n .i = $ Ai (D„) andp = s. 

Let us first compute the covariance matrix of (|A„| _1 / 2 i? 00: A ri ($; hi, #*)) ._ 1 . By the same 
calculations as for the term Si in the proof of Proposition 4, we obtain 

co W ^|A„|- 1 / 2 i? COiA „(*; (^,^),|A„|- 1 / 2 i? oc , An 

~^ E E^a^)^;^,^)^^^;^,^))- (50) 
fceBo(i) 

The asymptotic covariance matrix is thus S 2 (0*) defined in Proposition 6. We can now apply 
Theorem 21 in the appendix with X = S 2 (0*). The assumption (54) hods because D n > D 
and from [N4] and [E2(bis)]. The assumptions (i), (ii) and (m) follow from [E2(bis)], [N2] 
and Lemma 19. Assumption (iv) may be checked easily as in the second step of the proof of 
Proposition 4. by using (50). 

8.5 Proof of Lemma 8 

For simplicity let Ya := Y A ($;0). Let us denote A(S,D V ) := Ui 3 -i<rDv mi Aj(5). From the 
additivity property of Y, proving Lemma 8 reduces to prove that for any 5 > and any D v > D, 
D d A(S,D v ) = S d A(D, D) where 



A(6,DV);=E Y Ao(5) Y S((5i0 



29 



Since D v > D, we can write A(S,D V ) = A(S,D) U A', where A' C (A(S, D)) c . From the 
locality assumption, Ya' is only a function of ^a^*)- So 

E (y AoW Y A ' T ) = E (E (Y Aa(&) Y A ?\<S>^ {&) )) = E (E (Y Ao(a) |$ AS(5) ) Y A < T ) = 0, (51) 

which yields A(S,D V ) = A(5,D). By denoting A(S) := A{5,D y ) = A(S,D) and A(S) := A(S,D), 
we must prove D d A{8) = 5 d A(D). 

Let us first assume 5 = kD with k e N. We may write A(kD) = (A (kD) ® D) U A' and 
may assert that Ya' depends only on a function of ^Af,(kD)- By a similar argument as in (51), we 
obtain A(6) = E(YA (fen) ^A (kD)®D T )- From the disjoint decomposition A${kD) = Uj£/cAj(D) 
where |/C| = k d , we have, by the same decorrelation argument as above and by stationarity, 

A(S) = ^E(Y Aj(£1) Y Ao(t;D ) eD T ) = ^ E(Y A . (D) Y Aj(D)e /) 

jeK jeK 

= k d V(Y AoiD) Y Ao{D)@D T ) = ^A(D). 

Let us now assume D = k5 with k € N. First notice that in this case A(D) = A(<5)© -j (1 — 1/fc). 
The following decomposition holds: Aq(D) = Uj 6 /cAj(<5) where \K.\ = k d . For any j € /C, |j| < 
-j(l — 1/fc), so A(D) contains any translation of the set A(5) with respect to j. Let us denote this 
translated set by tjA(S). From the same decorrelation argument as above and by stationarity, we 
have 

A(D) = ]T E(Y Aj(6) Ya (Z3) t ) = ^ E(Y Aj(5) Y TjS(5) t ) = £ E(Y Ao(5) Y S(5) T ) = ^(<J). 
jetc jetc jeic 

(52) 

Let us now consider the case D/S = k'/k, where (k, k') £ N 2 . Let 5' = 5/k, then D = k'S' 
and according to (52), 6' A(D) = D d A(6'). In the same way as we have proved D d A{5) = S d A(D) 
when 6 = kD, this is not difficult to show that for any 5 = kS' with 5' < D, S' d A(5) = S d A(S'). As 
a consequence when D/S = k'/k, we obtain 

A(D) = ^A(S>) = *A(S). (53) 

In the general case, one may find a sequence of rational numbers (q n )neN which converges to 
D/S. Let S„ = q n D, we have from (53), A(D) = ^ T A(S n ). Since we have assumed E(Yp ) — > 
when r„ — > 0, the additivity of Y and S n — > S yield 

A(S n ) = E (Y Ao(M Y S(M T ) E (y Ao(5) Ya W t ) = A(S) 

as n goes to infinity. Therefore, the identity (53) holds for any S > 0, which concludes the proof. 

8.6 Proof of Proposition 7 

The proof follows arguments presented by Jensen and Kiinsch in Jensen and Kiinsch (1994). Let 
C n (S) = [-riS -S/2,nS_+S/2} d , so C n (5)_= U k&Kn A k {S), where fC n = [-n,n} d CiZ d and A k (S) is 
the cube centered at kS with side-length S. We have 

yar(|C„(5)|-V 2 Y Cn(I) ($;^)) =\C n (S)\~ 1 E E (y^ ($; 0*) Y Aj(S) (*; 

= ICnisr 1 E E E ( y a,w y a j( ?) (<M*) T ) ■ 

!£C »3eB,([fl)nc„ 



30 



-=d. 



Since |C„(<5)| =6 |/C n |, from the ergodic theorem, 
Var(\C n C6)\- 1,2 ^cM^ e * 



E E(Y Ao( , ) ($ ; r)Y Afc( , ) ($ ; rf 
*i<r#l 



which is M(fl*) by Lemma 8. 

Therefore, to prove that M(6**) is positive-definite, it is sufficient to prove that the covariance 

matrix Var ^|C„(<5)| _1 / 2 Y Cm ^($; 9*)^j is positive-definite for n large enough. Let x e M. q \ {0}, 

we must show that 

V := ^Var (\C n (S)\~ 1/2 ^c n Cs)^ **)) x > °" 
Since, for two random variables X, X' with finite variance 

For(X) = E(Var(X\X')) + Var(B(X\X')) > E(Var(X\X')), 



we have, by denoting L := I 2 



1 Z d 



E E 



CM-WviVarf E E Y Ai(3) (*;fl*)|* Afc(3)> ft^I 



=s» „(*) 



There- 



Note that from the locality property, S^„(G>) depends only on ^a-(5) ^ or ^ ^ f (2 
fore, conditionally on 3> Ajt (a\, k ^ L, the variables S^ n ($) and Sf,„($) (for £ ^ are independent. 
Now, let A((5) := U^.^j-Di Aj(<5), from the stationarity we have for n large enough 



V > IC^S)]- 1 ^ J2 E [ Var (S/,n(*)|* Afc (!) ! 



feLn/c„ 



k i L 



5^\LnlU xE , Va , ^ I* ! < | fc | < 2 



> «(?,£), d) x E f^ar (^Y^ ($;**) |* Afc(J) ,l< |*| < 2 



where n(S,D,d) is a positive constant. Assume there exists some positive constant c such that 



c when the variables < f Afc (j),l < |fc| < 2 



are fixed to belong 



P fl .-a.s. x'Y^^ff*) 
to -B, where £? e T is involved in [PD]. It follows that for any tpf € A; for i = 0, . . .,£ (with 
£ > 1), where the A's come from [PD], x 7 (y a ^ (<^;0*) - Y s ^ {<p ;8*) \ = 0. Since for all 

(tp ,...,<pi) £ A x . . . x A e , the matrix with entries ( Y S(7) (W^*)) . _ ( Y a(7) i'Po',0*)) . is 
assumed to be injective, this leads to x = and hence to some contradiction. Therefore, when 



the variables <£> 



, 1 < |fc| < 2 



are for example assumed to belong to B, the variable 
x^Y^p^ ($; 0*) is almost surely not a constant and so V > 0, which proves that M(0*) is a 
symmetric positive-definite matrix. 
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8.7 Proof of Proposition 9 

Since for any ip £ 0, M n (y>; -,<5, Z) v ) is continuous in a neighborhood V(#*) of 9* and according 
to [El], it is sufficient to prove that for any 9 <G V(9*), M„(<&; 6, 5„,D V ) converges in probability 
towards M(9). 

We choose the sequence S n as follows : 



Sn = 



I A, 



ii/d 



I A, 



5 J 



which guarantees 5 no = S, since |A„ |<5~ d e N, 5 < S n < 25 for n sufficiently large, and 5 n — > 5 as 
n — > oo. This choice allows us to consider, for all n £ N, the decomposition A„ = Ufc£jc n A/.(5 n ), 
where the Afe(<5„)'s are disjoint cubes with side-length 5 n and centered at kd n . Moreover, since 

5 n > 5 and 5 n — > 5, we have 
of the proof. 



Let K n ■= IC n n (Uj G aK„Bj 







"d v 1 






s 










5 


))■ 



when n is large enough, which is assumed in the sequel 
Since |A„| = £^|/C„|, we have 



5ZM n (<Z>;9,5 n ,DV)-5 d M(9) 



<X 1 +X 2 +X 3 +X i 



where by setting Afc(r) = Uj C:Efc ( I""/ 1 ) A? ( T ) (f° r some T > 0), 

l^"!" 1 E E Y„. Afc(5n) ($;0)Y n!Aj(5n) ($;i 

fcefc„ jeB fc (|~^"|)n/c n 



Xo 



X* 



keK. n \K„ 

\Kn\~ 1 J] ( Y -A fc ( 5 „) (*;<?) Y„ >Sfe((5n) (*;(9f - Y AkW Y Sfc(5) (4>;0) T 

kelC„\K n 



Xa 



We have from the additivity and the stationartiy of Y„ 



-E(Y Ao(5) (<M)Y So(5) (d>;0) T 



E|Xr| < i^r 1 E E E 

fee/c„jeB fc ([^])n/c n 



Y 



Y„, Ao(5n) ($;^)Y n)So(ati) ($;i 



I I 

which tends to as m +oo, because j^Aj — > and 5 < 5 n < 25. Therefore, X\ converges 
in probability to 0. The second term converges also to in probability from the additivity of Y 
and Y and from (16). The expectation of the third term converges to by following the proof 
of Lemma 2(). Finally, from the stationarity of Y and since \K n \ ~ \JC n \ IC n \, the mean ergodic 
theorem applies to E|_X"4|, which, in particular, shows that X^ — > in probability. This proves that 

6%M n ($;9,5 n ,D v ) S d M(9), 

in probability, as n — > oo. Since 5 n is a deterministic sequence converging to <5, the conclusion of 
Proposition 9 follows. 
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A Central Limit Theorem 



The following result is a central limit theorem for conditionnally centered random fields. It gener- 
alizes Theorem 2.1 in Jensen and Kiinsch (1994) to a non-stationary and non-ergodic setting. A 
general result has been proved by Comets and Janzura (1998) for self normalized sums, provided a 
fourth moment condition. Our result is in the same spirit but it is proved for triangular array and 
without self-normalization, which is well-adapted to the residuals process framework. This allows 
in particular to avoid the fourth moment assumption. 

Theorem 21. Let X nt i, n 6 N, i G 7L d , be a triangular array field in a measurable space S. For 
n EN, let K. n C 1 d and for k € JC n , assume 



Z n ,fe — fn.k (X nt k+i, i & Iq) > 

where I = {i£ Z d , \i\ < 1} and /„, fc : S x ° -> MP. Let S„ = EkeK„ Z «^- V 
(i) c 3 := sup„ eN sup feeKn E|Z„, fe | 3 < oo, 
(it) Vn € N, Vfc e Kn, E(Z n! fc|X„ ii; j^k) = 0, 
(Hi) \JC n \ — !> +oo as n — > oo, 

(iv) There exists a symmetric matrix S > such that 



(54) 



E 



keK n jeB k (l)n/C„ 



_ v 



0. 



as n — > oo . 



then l/Cnl-^Sn -A JV (0,E) 
Proof. 

Let us first assume that £ is a positive-definite matrix (i.e. S > 0). According to the Stein's 
method (see also Bolthausen (1982)), it suffices to prove that, for all e 6 MP such that ||e|| = 1 and 
for all A G R, 

E ((iA - ^\Kn\- l/2 ^r 1/a S n ) cxp (iAe T |X;„|- 1 / 2 S- 1 / 2 S„)) -> 0. 
Denoting u = Ae, this is equivalent to prove that for all u € MP, 

/ \ 



E 



\ 



:=A 



/ 



We decompose the term A in the same spirit as in Bolthausen (1982), Jensen and Kiinsch 
(1994) and Comets and Janzura (1998). Let us denote by I p the identity matrix of size p and 



s n - EjeB fc (i)n/c„ Z 
E(Ai — A 2 — A3) where 



jeTB k (i)nK n "n,j- Noting that ir £ ' S„ = S„ S ' u, the decomposition is E(A) 



Ai = iexp(iif l/Cnl-^S-^Sn) 



ip-i^r 1 ^- 1 / 2 £ Z„, fc S^ 



fee/c„ 



A 2 = exp(iif l/Cnl- 1 /^- 1 /^^!^!- 1 /^- 1 ^ 

]r z„, fe (1 - cxp(- i u T i^ l r i /2s- i /2 S fe ) _ i^r^s-vx) , 



fce/c„ 



A 3 = |^ l |- 1 / 2 £- 1 / 2 £ Z n , fc exp^|^ i r 1/2 S- 1 / 2 (S ri -S,J 

feG/C„ 
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The two last terms A2 and A3 can be handled as in Jensen and Kiinsch (1994): E(Aa) = by 
(ii) and |E(A2)| — > from the same inequalities therein and the sub- multiplicative property of the 
Frobcnius norm. These inequalities rely on two facts: \/x € K, |1 — e~ %x — ix\ < x 2 /2 and for all 

n and for all (k u k 2 ,k 3 ) G IC n , E(\Z nM \\Z nM \\Z nM \) < (E(|Z B , fcl | 3 )E(|Z„, fc2 | 3 )E(|Z„^ 3 1 3 )) 1/3 
which is less than C3 by (i). 

For Ai , we cannot use a mean ergodic theorem as in Jensen and Kiinsch (1994), but Assumption 
(iv) is sufficient. Indeed, 



|E(Ai 



< llullE 



< u 



l p - \Jc n \-^ 2 Yl E z„, fc z„/s-v^ 

fce/c„ jeB fc (i)n/c„ 



-1/2 



E 



\fc n \ 1 2^ E z nj feZ nj j T — s 

fee/c„ jsB fc (i)n/c„ 



which tends to by (iv). 

Now, if £ is not a positive-definite matrix, one can find an orthonormal basis (fi, . . . , f p ) of R p , 
where the f,'s are eigenvectors of S. We agree that, if r < p denotes the rank of 52, then (fi, . . . , f r ) 
is a basis of the image of 52, while (f r +i, . . . , f p ) is a basis of its kernel. 

Let us denote by V Jm the matrix whose columns are (fi, . . . , f r ) and V Xer the matrix whose 
columns are (f r +i, ■ ■ • , f p ). Similarly, for any u G W, let us denote by Ui its z-th coordinate in the 
basis (fi,...,fp) and u /m = (u 1} . . . , u r ), u Ker = (u r +i, . ■ ■ , u p ). Hence u = Vf TO Uj m +V Ker u Ker . 

The convergence in law of |/C„| _1 / 2 S„ to a Gaussian vector reduces to the convergence of 
u T |K„|- 1/2 S„ for all u G W. We have 



U T |/C„| 1 ^ 2 S„ = U/ m T V Jm T |/C„| l ' 2 S n + U K er T ¥_Ker T \fcr 



From (iv) and since V Kc 



52 y_ Ker — 0, we deduce that 



E \K n \ 



1 E 

fce/c„ jeB fc (i)n/C 



E Yirer T Zn,fcZ„ > /V ii:e 



-l/2( 



(J. 



(55) 



which means that V Ker T |/C n | _1 / 2 S„ tends to in quadratic mean. 

On the other hand, the assumptions of Theorem 21 imply that (i) — (iv) remain true when one 
replaces Z n ^ by V Im T Z ni fc and 52 by V Jm T 52 V Jm . Since V Jm T 52 V Jm is positive-definite, the 
convergence in law of V Jm |/C„| -1 / 2 S„ holds for the same reasons as in the first part of the proof. 

Therefore, we have proved that for all u G W, u T \K. n \-^ 2 S n -A Af (o, u /TO T V Jro T 52 V Jm Uj 



It is easy to check that u/ m V /m 52 V Jrn Uj m = u Su, which concludes the proof. 



B Assumption [PD] on two examples 

In this section, we focus on the two following models, belonging to the exponential family 



1. Two-type marked Strauss point process : M = {1, 2} and for 9 = l0\, 6\ 1 O^ 1 , O^ 2 , 2 ' 2 



for any A G B(R d ), 

v K (y ] e) = 6\ w\\ +e\ w\\ + ]T e^> 



E 



1 [0 Dmi , m2] (||a;2 - Xi\\) . 



~«A, l(v) 



m 1 ,m 2 = l 
'■= V A,AV> mi<m 2 
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Alternatively, 

2 

V(x m \<p;6)=6? + 6 2' m ' E lp ™,^- 2 ](ll2/-^ll)- 

m' = l y™' £<p 

This process is well-defined when 2 1,T ™ 2 > and jj™ 1 '™ 12 — o (inhibition assumption), or when 
^mi, m 2 g jg>2 an( j j^m 1 ,m 2 _ ^ > g ( nar( j_ core assumption), see Proposition 13 for instance. The 
range of the local energy function equals D = max (D 11 , D 1 ' 2 , D 2,2 ). 

2. Area interaction point process : M = {0} and for R > 0, 6 = (61,62) and any A € £>(M rf ), 

V A (tp;6) = 6i\cp A \ +62V 2 {'Pa), with v 2 ((p A ) := |Ua. Gv , A B(x, . 

Alternatively, 

«i(0|p) := 1, «2(0|p) := U a , G ( ¥ , s(0 3fl)U{0} )B(a;, fl) \ U KevB(0 2H) 6(x, H) 

This model is well-defined for € M 2 (see Proposition 13 for instance) and the range of the local 
energy equals D = 2R. 

Both these models satisfy [C] and [Nl-4]. The aim of the sequel is to prove Proposition 18, 
which claims that 5^(0*) and S 2 (0*), involved respectively in Proposition 4 and 6, are positive- 
definite for these models, when considering the maximum pseudolikelihood estimate for 6 n and the 
two following frameworks 

• Framework 1 (for S 1 (0*)): we consider the inverse residuals (h = e v ). Let us recall that 
Proposition 17 asserts that [PD] fails for the raw residuals (h = 1) for both the area- 
interaction and 2-type marked Strauss models. 

• Framework 2 (for S 2 (0*)): we consider the family of test functions given for j = 1, . . . , s and 
< r\ < . . . < r s < +00 by 

hi{x m ,wff) = l [0>rj] (d(x m ,cp))e v ^ m ^ e \ 

related to parametric and nonparametric estimations of the empty space function at distance 
r r 

When considering the MPLE, Roo lA ((p; h, 6*) is given by (26) with LPI/ 1 * 1 , H and £ respectively 
given by (24), (25) and (7). 

B.l 2-type marked Strauss point process 

We only deal with the inhibition case, that is = M. 2 x and Z)™ 1 '" 12 = . The following proofs 
could easily be extended to the hard-core case and to the multi-Strauss marked point process (see 
e.g. Billiot et al. (2008)). For any vector z of length 5, we sometimes reparameterize it similarly 

as the parameter vector, that is z = (z\,z 2 , z\' 1 7 z^ 2 , z^' 2 ) . 

B.l.l Proof that 5^(0*) is positive-definite for the two-type Strauss model 

From Proposition 7, proving that S 1 (0*) is positive-definite in Framework 1 reduces to check 
Assumption [PD] with h = e v and 

(i) Y A (<p;6*)=I A (<p;e v ,6*), 

(ii) Y A (<p;6*)=R oo , A (<p- ) e v ,0*). 
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(i) is ensured by Proposition 16. 

(m) Wc fix S — D and B = in [PD]. Let il := 00. Without loss of generality, one may assume 
that 6*2 1,1 > 0. Let us define for n > 1 

A n ,_(r?) = {if G H : p(Ao® x {1}) = 2n, ^(A (d) x {2}) = 0, 

p (B ((0,0), I)) = n,<p(B ((U 1 - 1 -2,0), J))= „} , 

A„, + (r?) = g O : ^(Ao(3) x {1}) = 2n, fp(A (3) x {2}) = 0, 

p (B ((0,0), I)) = n,<p (B ((D^ + \ ,0), I)) = n} . 

Let <^ ra ._ G and y n ,+ G A„. + . Then for 77 small enough 

- ( 2ne e * 1+( - 2n -^ e i 1 ' 1 if« = - 



(lPL^ **)) = /_ «r'(x m |^,.)e- V(xm|p - ;9 *V(^ m )- 

V 71 JAxM 

(LPll 1) (9 3ni _;r)) miX = / v^ m \x m Wn,-)e- v ^ m \^ *^{dx m ) 

^ '2 ./A vl\J 



2n ifm' = l, 
if m' = 2. 



2 JAxM 

{2n(2n — 1) if mi = m-2 = 1, 
otherwise. 

LPL«(^ +; r)) ' = /_ v^ m *{x m \<p n , + )e- y ^ ^> e ^(dx m ) 

{n(n — 1) if mi = m-2 = 1, 
otherwise. 

Now, 

A - R oo,x(^n,->Vr l) +) := i^xOn,-;^,!?*) - # 00 ,x(v>n,+ ; e v , 0*) 
= 2n (e^+^-D'S 1 ' 1 - Jf+Pn-Ve?- 1 ") + (W(e v ,6*)) 1 / (2n(2n - 1) - n(n - 1)) 

= 2n e e "+("- 1 ) e "' 1 (1 - e " e " a ) + n(3n - 1) (W(e y , e*))^ 1 +/(*>„.-, ¥>n,+, W, v). 

v J 

V 

Fix e > 0, there exists no > 1 such that for all n > rio, x n < —e. Now by a continuity argument, 
there exists 770 = 770(710) such that for all rj < 770(710), \f(ip n0t -,ip no ,+ ,W, 77) | < e/2. Therefore by 
assuming that Ai?^ j(<ys„ 0i _, <y9„ . + ) = 0, wc obtain for 77 < 770 

= l A -Roo,A^«o,-^no, + )l > Ko\~ l/Ono,-, <£»o,+, W, 77)| > e/2 > 

which leads to a contradiction and proves [PD] . 

B.1.2 Proof that £ 2 (#*) is positive-definite for the two-type Strauss model 

From Proposition 7, proving that S 2 (0*) is positive-definite in Framework 2 reduces to check 
Assumption [PD] with 

Y A (v;0*) = Roo,Ateh;0*), 
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where, for all j = l,...,s, (R OCj A(<y5; h; = Roo,A(f',hj,8*), hj is the test function given by 

hj{x m , ip; 9) = l [0 . rj] (d(x m , ip))e v ^ m ^ e \ We fix as before 5 = D and B = in [PD]. 

Let < r\ < . . . < r s < +oo. Let us also assume that r, ^ D for i = 1, . . . , s and define 



4- fa) + ^ A °( D » ^ 2 ^{ B ((0. °)' |) >< {!>) = 1, V (B (fa - f ,0), ? ) 



.4 



ftfo) = {<p G n : ^(A (£>)) = 2,<^ (s ((0, 0), |) x {!}) = 1, ^ (fi (fa + |, 0), ? ) 



)x {!}) = !} 



)x {!}) = !} 



Let tp i>9 G A/, (r?) for 



and i= 1, . . . , s. Let «j the constant given by 

J 2e^ 1+fl 3 1,1 if n < D, 
2e 1 otherwise. 



Then for i,j = 1, . . . , s and for 77 small enough 

h j (x m ,i Pi , + )e- v ^ m ^+^n(dx m ) 



I AxM 



'AxM 








On the other hand 



LPL^V,,.;^ 



vf{x m \tp h .e- v{xm ^'<'- r) f x(dx m ) - 



AxM 



if i < j, 
otherwise. 

if i < j, 
otherwise. 



if m! 



(LPL«(^,.;i 



mi,m2 
2 



u 2 2 (x l^.e 



if m! = 2 



A • M 



2 if mi = TO2 = 1 

and Ti < D 
otherwise. 



Let xel s \ {0} , then from previous computations 

^ ( R oo.a(^+; h, 0*) - R^r^-ih, 6*)) = 2k iXi + f(x, <^,+ , h). (56) 



By using a continuity argument, one may prove that for every e > there exists 77 > such that 
|/((x, , <Pi ~, h)| < e. Therefore, assuming that the l.h.s. of (57) equals leads to Xi = for 
i = 1, ...,s. 

B.2 Area-interaction point process 

We fix for simplicity d = 2, though the proofs may be extended easily to higher dimensions. 

B.2.1 Proof that 5^(0*) is positive-definite for the area-interaction model 

From Proposition 7, the proof reduces to check Assumption [PD] with h = e v , 5 = D, B = and 

(i) YA(<p- > 6*)=I A (<p;e v ,e*), 

(it) Y A (i P ;e*)=R 00 , A (cp;e v ,e*). 

Again (i) is ensured by Proposition 16 since this model satisfies [Exp] . 
(ii) Let us consider for some 77, tu > the two following events: 



A 1 {r 1 ,u) := { V Gn: V (A (*)) = 2, V (B((0 ) 0),i 7 )) = l,^(B((0,w),» 7 )) = l} 
A 2 ( ?7 , w ) := {^en:^(Ao(5))=3 >V (B((0 ) 0),» 7 )) = l,^(B((0,w),» 7 )) = 2} 
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Fix r),u>, let tpj G Aj(r),ui) and denote by e v (y>) := Exe^ e v( - x ^\ x ' 6 ^ 



When i] ->• 0, 



/ x ( W e^,0*) = |A|-e^)- 



e^(^i) -> 2e^ + ^* 9 ^ and e^ 2 ) -> 2e^ + e^+^sM 
where g(u) := \B((0,0),R)uB((0,u),R)\-\B((0,Q),R)\. Moreover, by denoting v 2 (<p) = E x&v «a(aM 



2 if j = 1 

3 if j = 2 



(LPL£>fo;0*)) = [_v 2 (x\ Vj )e- v ^^dx-v 2 (Vj) 



2 JA 

Again, when 77 — >■ 0, one may note that for fc = 1, 2 



Wfc(£|pi)e- y ^ lie *:>cfo - v^x^e'V^^dx -> 



and 

^2(^1) -> 2g(w) and ^2(^2) -> 

These computations lead to 

R 0O ^ 1 ;e v ,8*)-R oo ^ 2 ;e v ,9*) = 2S - e «f+*Si>M - (W(e v , (!>*)), + (W(e v ,6>*)) 2 

+ /(^1,<^2,W), 

where the function / is such that for all e > 0, there exists 77 small enough such that \f(<pi, ip 2 ,W)\ < 
e. Let tpj G A, (77,0), then, since <?(0) = 0, assuming that the l.h.s. of the previous equation equals 
leads to (W(e v ,0*)) = e e *. Now, let u) > and again assume that R (X} -^(ipi;e v ,6*) = 
R x x(^2; e V i 0*), we therefore obtain (by the continuity argument) 

e ^+0 2 * g (aO _ e «i 

(w(e".«')) 3 = ,(„) ■ 

But (W(e v , 9*)) 2 is a constant and so cannot depend on uj. Therefore one of the assumptions 
made before is untrue, which proves [PD]. 



B.2.2 Proof that 5] 2 (#*) is positive-definite for the area-interaction model 

From Proposition 7, it suffices to check Assumption [PD] with 

Y A (w0*) = Roo 1 Afo>;h;0*) 1 

where, for all j = l,...,s, (R 00j a((/?; h; 6*)). = R QO _A(ip;hj 1 9*), hj is the test function given by 
hj(x m ,ip; 6) = l [o , rj] (d(x m ,(p))e v( - xm ^' \ and where, again, we choose 6 = D and B = 0. 

The proof is quite similar to the one proposed for the 2-type marked Strauss point process (see 
B.2.2). Let < 7*1 < . . . < r s < +00. Let us also assume that fj ^ D for i = 1, . , . , s 

Ai,-(V) = ^6n:^(A (5)) = 2 jV (B((0 J 0),2)) = 1, p (b ((r< - |, 0), |)) = l}, 
Ai,+ (V) = [p £«:#o(5)) = 2,<p (b ((0,0), I)) =1,^(^(^ + 1,0), J)) = l}. 
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Let i,j G {1, . . . , s} and k G {1, 2}, let (pi t _ G and ^ + G A;. + , then 

(lPL^^..;**)) = f_v k (x\^.)e- v ^'^dx- v fc (*lw \ x), 

A xe^i,. 

for • = -,+. It is expected that for small jj, (lPL^^ _; 6>*)^ ~ ^LPL^ (<Pi,+ ; and 
e^fai,-) ^ e^(<Pi,+) ^ «i := 2e e i*+ e 2ie(o,i?.)ui3(r t ,i?.)|^ Lct x g RS \ | | ^ thcn from prcvious 
computations 

^ ( R oo,A(^,+ ; h ^*)- R oo,A(^,-; h ^ ,t )) =2K i X i + /(x,^ ii + ,^ i _,h) (57) 

where for every e > there exists 77 > such that |/((x, <fi+, ¥>j,-, h)| < e. Therefore, assuming 
that the l.h.s. of (57) equals leads to Xj = for i = 1, . . . , s. 
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